#include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include namespace DB { namespace ErrorCodes { extern const int INCORRECT_QUERY; extern const int INVALID_SETTING_VALUE; extern const int UNKNOWN_SETTING; extern const int LOGICAL_ERROR; } namespace { struct ExplainAnalyzedSyntaxMatcher { struct Data : public WithContext { explicit Data(ContextPtr context_) : WithContext(context_) {} }; static bool needChildVisit(ASTPtr & node, ASTPtr &) { return !node->as(); } static void visit(ASTPtr & ast, Data & data) { if (auto * select = ast->as()) visit(*select, ast, data); } static void visit(ASTSelectQuery & select, ASTPtr & node, Data & data) { InterpreterSelectQuery interpreter( node, data.getContext(), SelectQueryOptions(QueryProcessingStage::FetchColumns).analyze().modify()); const SelectQueryInfo & query_info = interpreter.getQueryInfo(); if (query_info.view_query) { ASTPtr tmp; StorageView::replaceWithSubquery(select, query_info.view_query->clone(), tmp, query_info.is_parameterized_view); } } }; using ExplainAnalyzedSyntaxVisitor = InDepthNodeVisitor; } BlockIO InterpreterExplainQuery::execute() { BlockIO res; res.pipeline = executeImpl(); return res; } Block InterpreterExplainQuery::getSampleBlock(const ASTExplainQuery::ExplainKind kind) { if (kind == ASTExplainQuery::ExplainKind::QueryEstimates) { auto cols = NamesAndTypes{ {"database", std::make_shared()}, {"table", std::make_shared()}, {"parts", std::make_shared()}, {"rows", std::make_shared()}, {"marks", std::make_shared()}, }; return Block({ {cols[0].type->createColumn(), cols[0].type, cols[0].name}, {cols[1].type->createColumn(), cols[1].type, cols[1].name}, {cols[2].type->createColumn(), cols[2].type, cols[2].name}, {cols[3].type->createColumn(), cols[3].type, cols[3].name}, {cols[4].type->createColumn(), cols[4].type, cols[4].name}, }); } else { Block res; ColumnWithTypeAndName col; col.name = "explain"; col.type = std::make_shared(); col.column = col.type->createColumn(); res.insert(col); return res; } } /// Split str by line feed and write as separate row to ColumnString. static void fillColumn(IColumn & column, const std::string & str) { size_t start = 0; size_t end = 0; size_t size = str.size(); while (end < size) { if (str[end] == '\n') { column.insertData(str.data() + start, end - start); start = end + 1; } ++end; } if (start < end) column.insertData(str.data() + start, end - start); } namespace { /// Settings. Different for each explain type. struct QueryASTSettings { bool graph = false; bool optimize = false; constexpr static char name[] = "AST"; std::unordered_map> boolean_settings = { {"graph", graph}, {"optimize", optimize} }; std::unordered_map> integer_settings; }; struct QueryTreeSettings { bool run_passes = true; bool dump_passes = false; bool dump_ast = false; Int64 passes = -1; constexpr static char name[] = "QUERY TREE"; std::unordered_map> boolean_settings = { {"run_passes", run_passes}, {"dump_passes", dump_passes}, {"dump_ast", dump_ast} }; std::unordered_map> integer_settings = { {"passes", passes} }; }; struct QueryPlanSettings { QueryPlan::ExplainPlanOptions query_plan_options; /// Apply query plan optimizations. bool optimize = true; bool json = false; constexpr static char name[] = "PLAN"; std::unordered_map> boolean_settings = { {"header", query_plan_options.header}, {"description", query_plan_options.description}, {"actions", query_plan_options.actions}, {"indexes", query_plan_options.indexes}, {"optimize", optimize}, {"json", json}, {"sorting", query_plan_options.sorting}, }; std::unordered_map> integer_settings; }; struct QueryPipelineSettings { QueryPlan::ExplainPipelineOptions query_pipeline_options; bool graph = false; bool compact = true; constexpr static char name[] = "PIPELINE"; std::unordered_map> boolean_settings = { {"header", query_pipeline_options.header}, {"graph", graph}, {"compact", compact}, }; std::unordered_map> integer_settings; }; template struct ExplainSettings : public Settings { using Settings::boolean_settings; using Settings::integer_settings; bool has(const std::string & name_) const { return hasBooleanSetting(name_) || hasIntegerSetting(name_); } bool hasBooleanSetting(const std::string & name_) const { return boolean_settings.count(name_) > 0; } bool hasIntegerSetting(const std::string & name_) const { return integer_settings.count(name_) > 0; } void setBooleanSetting(const std::string & name_, bool value) { auto it = boolean_settings.find(name_); if (it == boolean_settings.end()) throw Exception(ErrorCodes::LOGICAL_ERROR, "Unknown setting for ExplainSettings: {}", name_); it->second.get() = value; } void setIntegerSetting(const std::string & name_, Int64 value) { auto it = integer_settings.find(name_); if (it == integer_settings.end()) throw Exception(ErrorCodes::LOGICAL_ERROR, "Unknown setting for ExplainSettings: {}", name_); it->second.get() = value; } std::string getSettingsList() const { std::string res; for (const auto & setting : boolean_settings) { if (!res.empty()) res += ", "; res += setting.first; } for (const auto & setting : integer_settings) { if (!res.empty()) res += ", "; res += setting.first; } return res; } }; struct QuerySyntaxSettings { bool oneline = false; constexpr static char name[] = "SYNTAX"; std::unordered_map> boolean_settings = { {"oneline", oneline}, }; std::unordered_map> integer_settings; }; template ExplainSettings checkAndGetSettings(const ASTPtr & ast_settings) { if (!ast_settings) return {}; ExplainSettings settings; const auto & set_query = ast_settings->as(); for (const auto & change : set_query.changes) { if (!settings.has(change.name)) throw Exception(ErrorCodes::UNKNOWN_SETTING, "Unknown setting \"{}\" for EXPLAIN {} query. " "Supported settings: {}", change.name, Settings::name, settings.getSettingsList()); if (change.value.getType() != Field::Types::UInt64) throw Exception(ErrorCodes::INVALID_SETTING_VALUE, "Invalid type {} for setting \"{}\" only integer settings are supported", change.value.getTypeName(), change.name); if (settings.hasBooleanSetting(change.name)) { auto value = change.value.get(); if (value > 1) throw Exception(ErrorCodes::INVALID_SETTING_VALUE, "Invalid value {} for setting \"{}\". " "Expected boolean type", value, change.name); settings.setBooleanSetting(change.name, value); } else { auto value = change.value.get(); settings.setIntegerSetting(change.name, value); } } return settings; } } QueryPipeline InterpreterExplainQuery::executeImpl() { const auto & ast = query->as(); Block sample_block = getSampleBlock(ast.getKind()); MutableColumns res_columns = sample_block.cloneEmptyColumns(); WriteBufferFromOwnString buf; bool single_line = false; bool insert_buf = true; SelectQueryOptions options; options.setExplain(); switch (ast.getKind()) { case ASTExplainQuery::ParsedAST: { auto settings = checkAndGetSettings(ast.getSettings()); if (settings.optimize) { ExplainAnalyzedSyntaxVisitor::Data data(getContext()); ExplainAnalyzedSyntaxVisitor(data).visit(query); } if (settings.graph) dumpASTInDotFormat(*ast.getExplainedQuery(), buf); else dumpAST(*ast.getExplainedQuery(), buf); break; } case ASTExplainQuery::AnalyzedSyntax: { auto settings = checkAndGetSettings(ast.getSettings()); ExplainAnalyzedSyntaxVisitor::Data data(getContext()); ExplainAnalyzedSyntaxVisitor(data).visit(query); ast.getExplainedQuery()->format(IAST::FormatSettings(buf, settings.oneline)); break; } case ASTExplainQuery::QueryTree: { if (ast.getExplainedQuery()->as() == nullptr) throw Exception(ErrorCodes::INCORRECT_QUERY, "Only SELECT is supported for EXPLAIN QUERY TREE query"); auto settings = checkAndGetSettings(ast.getSettings()); auto query_tree = buildQueryTree(ast.getExplainedQuery(), getContext()); if (settings.run_passes) { auto query_tree_pass_manager = QueryTreePassManager(getContext()); addQueryTreePasses(query_tree_pass_manager); size_t pass_index = settings.passes < 0 ? query_tree_pass_manager.getPasses().size() : static_cast(settings.passes); if (settings.dump_passes) { query_tree_pass_manager.dump(buf, pass_index); if (pass_index > 0) buf << '\n'; } query_tree_pass_manager.run(query_tree, pass_index); query_tree->dumpTree(buf); } else { query_tree->dumpTree(buf); } if (settings.dump_ast) { buf << '\n'; buf << '\n'; query_tree->toAST()->format(IAST::FormatSettings(buf, false)); } break; } case ASTExplainQuery::QueryPlan: { if (!dynamic_cast(ast.getExplainedQuery().get())) throw Exception(ErrorCodes::INCORRECT_QUERY, "Only SELECT is supported for EXPLAIN query"); auto settings = checkAndGetSettings(ast.getSettings()); QueryPlan plan; ContextPtr context; if (getContext()->getSettingsRef().allow_experimental_analyzer) { InterpreterSelectQueryAnalyzer interpreter(ast.getExplainedQuery(), getContext(), options); context = interpreter.getContext(); plan = std::move(interpreter).extractQueryPlan(); } else { InterpreterSelectWithUnionQuery interpreter(ast.getExplainedQuery(), getContext(), options); interpreter.buildQueryPlan(plan); context = interpreter.getContext(); } if (settings.optimize) plan.optimize(QueryPlanOptimizationSettings::fromContext(context)); if (settings.json) { /// Add extra layers to make plan look more like from postgres. auto plan_map = std::make_unique(); plan_map->add("Plan", plan.explainPlan(settings.query_plan_options)); auto plan_array = std::make_unique(); plan_array->add(std::move(plan_map)); auto format_settings = getFormatSettings(getContext()); format_settings.json.quote_64bit_integers = false; JSONBuilder::FormatSettings json_format_settings{.settings = format_settings}; JSONBuilder::FormatContext format_context{.out = buf}; plan_array->format(json_format_settings, format_context); single_line = true; } else plan.explainPlan(buf, settings.query_plan_options); break; } case ASTExplainQuery::QueryPipeline: { if (dynamic_cast(ast.getExplainedQuery().get())) { auto settings = checkAndGetSettings(ast.getSettings()); QueryPlan plan; ContextPtr context; if (getContext()->getSettingsRef().allow_experimental_analyzer) { InterpreterSelectQueryAnalyzer interpreter(ast.getExplainedQuery(), getContext(), options); context = interpreter.getContext(); plan = std::move(interpreter).extractQueryPlan(); } else { InterpreterSelectWithUnionQuery interpreter(ast.getExplainedQuery(), getContext(), options); interpreter.buildQueryPlan(plan); context = interpreter.getContext(); } auto pipeline = plan.buildQueryPipeline( QueryPlanOptimizationSettings::fromContext(context), BuildQueryPipelineSettings::fromContext(context)); if (settings.graph) { /// Pipe holds QueryPlan, should not go out-of-scope QueryPlanResourceHolder resources; auto pipe = QueryPipelineBuilder::getPipe(std::move(*pipeline), resources); const auto & processors = pipe.getProcessors(); printPipeline(processors, buf); } else { plan.explainPipeline(buf, settings.query_pipeline_options); } } else if (dynamic_cast(ast.getExplainedQuery().get())) { InterpreterInsertQuery insert(ast.getExplainedQuery(), getContext()); auto io = insert.execute(); printPipeline(io.pipeline.getProcessors(), buf); } else throw Exception(ErrorCodes::INCORRECT_QUERY, "Only SELECT and INSERT is supported for EXPLAIN PIPELINE query"); break; } case ASTExplainQuery::QueryEstimates: { if (!dynamic_cast(ast.getExplainedQuery().get())) throw Exception(ErrorCodes::INCORRECT_QUERY, "Only SELECT is supported for EXPLAIN ESTIMATE query"); auto settings = checkAndGetSettings(ast.getSettings()); QueryPlan plan; ContextPtr context; InterpreterSelectWithUnionQuery interpreter(ast.getExplainedQuery(), getContext(), SelectQueryOptions()); interpreter.buildQueryPlan(plan); context = interpreter.getContext(); // collect the selected marks, rows, parts during build query pipeline. plan.buildQueryPipeline( QueryPlanOptimizationSettings::fromContext(context), BuildQueryPipelineSettings::fromContext(context)); if (settings.optimize) plan.optimize(QueryPlanOptimizationSettings::fromContext(context)); plan.explainEstimate(res_columns); insert_buf = false; break; } case ASTExplainQuery::TableOverride: { if (auto * table_function = ast.getTableFunction()->as(); !table_function || table_function->name != "mysql") { throw Exception(ErrorCodes::INCORRECT_QUERY, "EXPLAIN TABLE OVERRIDE is not supported for the {}() table function", table_function->name); } auto storage = getContext()->getQueryContext()->executeTableFunction(ast.getTableFunction()); auto metadata_snapshot = storage->getInMemoryMetadata(); TableOverrideAnalyzer::Result override_info; TableOverrideAnalyzer override_analyzer(ast.getTableOverride()); override_analyzer.analyze(metadata_snapshot, override_info); override_info.appendTo(buf); break; } case ASTExplainQuery::CurrentTransaction: { if (ast.getSettings()) throw Exception(ErrorCodes::UNKNOWN_SETTING, "Settings are not supported for EXPLAIN CURRENT TRANSACTION query."); if (auto txn = getContext()->getCurrentTransaction()) { String dump = txn->dumpDescription(); buf.write(dump.data(), dump.size()); } else { writeCString("", buf); } break; } } if (insert_buf) { if (single_line) res_columns[0]->insertData(buf.str().data(), buf.str().size()); else fillColumn(*res_columns[0], buf.str()); } return QueryPipeline(std::make_shared(sample_block.cloneWithColumns(std::move(res_columns)))); } }