azure-databricks

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Azure Databricks Skill

Azure Databricks技能

This skill provides expert guidance for Azure Databricks. Covers troubleshooting, best practices, decision making, architecture & design patterns, limits & quotas, security, configuration, integrations & coding patterns, and deployment. It combines local quick-reference content with remote documentation fetching capabilities.
本技能为Azure Databricks提供专业指导,涵盖故障排除、最佳实践、决策制定、架构与设计模式、限制与配额、安全、配置、集成与编码模式以及部署。它结合了本地快速参考内容与远程文档获取能力。

How to Use This Skill

如何使用此技能

IMPORTANT for Agent: Use the Category Index below to locate relevant sections. For categories with line ranges (e.g.,
L35-L120
), use
read_file
with the specified lines. For categories with file links (e.g.,
[security.md](security.md)
), use
read_file
on the linked reference file
IMPORTANT for Agent: If
metadata.generated_at
is more than 3 months old, suggest the user pull the latest version from the repository. If
mcp_microsoftdocs
tools are not available, suggest the user install it: Installation Guide
This skill requires network access to fetch documentation content:
  • Preferred: Use
    mcp_microsoftdocs:microsoft_docs_fetch
    with query string
    from=learn-agent-skill
    . Returns Markdown.
  • Fallback: Use
    fetch_webpage
    with query string
    from=learn-agent-skill&accept=text/markdown
    . Returns Markdown.
对Agent的重要提示:使用下方的分类索引定位相关章节。对于带有行范围的分类(如
L35-L120
),使用
read_file
读取指定行。对于带有文件链接的分类(如
[security.md](security.md)
),使用
read_file
读取链接的参考文件
对Agent的重要提示:如果
metadata.generated_at
已超过3个月,建议用户从仓库拉取最新版本。如果
mcp_microsoftdocs
工具不可用,建议用户安装:安装指南
本技能需要网络访问权限来获取文档内容:
  • 首选方式:使用
    mcp_microsoftdocs:microsoft_docs_fetch
    ,查询字符串为
    from=learn-agent-skill
    ,返回Markdown格式内容。
  • 备选方式:使用
    fetch_webpage
    ,查询字符串为
    from=learn-agent-skill&accept=text/markdown
    ,返回Markdown格式内容。

Category Index

分类索引

CategoryLocationDescription
TroubleshootingL37-L127Diagnosing and fixing Databricks issues: compute startup/termination, SQL and connector errors, Lakeflow ingestion/pipelines, model serving, Genie/AI agents, logging, and performance debugging.
Best PracticesL128-L325Best-practice guidance for designing, operating, and optimizing Azure Databricks: governance, security, cost, performance, streaming, Delta, ML/GenAI, BI, Lakeflow, and troubleshooting.
Decision MakingL326-L406Guides for choosing Databricks/Lakebase runtimes, compute, storage and connectors, plus detailed migration paths (workloads, ML, SQL, ETL, data, APIs) and cost/scale configuration decisions.
Architecture & Design PatternsL407-L443Design patterns and reference architectures for Databricks lakehouse, AI agents (RAG, multi-agent, memory), MLOps/LLMOps, governance, Lakebase, and data modeling/replication.
Limits & Quotaslimits-quotas.mdLimits, quotas, and constraints for Databricks runtimes, compute, AI/BI, Lakeflow, connectors, foundation models, Unity Catalog, notebooks, Git, and related performance/scale behaviors.
Securitysecurity.mdIdentity, access control, encryption, networking, compliance, and secure integrations for Azure Databricks, Unity Catalog, Lakeflow, Lakebase, and Delta Sharing across users, apps, and data.
Configurationconfiguration.mdConfiguring and administering Azure Databricks: accounts, workspaces, security, networking, compute, storage, Unity Catalog, Lakeflow, ML/AI, system tables, SQL options, and CLI/bundle-based deployment.
Integrations & Coding Patternsintegrations.mdIntegrating Databricks with external systems (DBs, storage, BI/ETL, agents, Lakebase), using REST/CLI/SDKs, streaming/federated access, and coding patterns for Spark, SQL, ML, and GenAI.
Deploymentdeployment.mdDeploying and managing Azure Databricks workspaces, apps, models, agents, and pipelines via CI/CD, Terraform, VS Code, Model Serving, and migration/region/support considerations.
分类位置描述
故障排除L37-L127诊断并修复Databricks问题:计算资源启动/终止、SQL与连接器错误、Lakeflow摄入/管道、模型服务、Genie/AI代理、日志记录和性能调试。
最佳实践L128-L325Azure Databricks设计、运维和优化的最佳实践指导:治理、安全、成本、性能、流处理、Delta、ML/生成式AI、BI、Lakeflow和故障排除。
决策制定L326-L406指导选择Databricks/Lakebase运行时、计算资源、存储与连接器,以及详细的迁移路径(工作负载、ML、SQL、ETL、数据、API)和成本/规模配置决策。
架构与设计模式L407-L443Databricks湖仓、AI代理(RAG、多代理、内存)、MLOps/LLMOps、治理、Lakebase和数据建模/复制的设计模式与参考架构。
限制与配额limits-quotas.mdDatabricks运行时、计算资源、AI/BI、Lakeflow、连接器、基础模型、Unity Catalog、笔记本、Git以及相关性能/规模行为的限制、配额与约束。
安全security.mdAzure Databricks、Unity Catalog、Lakeflow、Lakebase和Delta Sharing在用户、应用和数据层面的身份、访问控制、加密、网络、合规和安全集成。
配置configuration.md配置与管理Azure Databricks:账户、工作区、安全、网络、计算资源、存储、Unity Catalog、Lakeflow、ML/AI、系统表、SQL选项,以及基于CLI/ bundle的部署。
集成与编码模式integrations.md将Databricks与外部系统(数据库、存储、BI/ETL、代理、Lakebase)集成,使用REST/CLI/SDK、流处理/联邦访问,以及Spark、SQL、ML和生成式AI的编码模式。
部署deployment.md通过CI/CD、Terraform、VS Code、Model Serving以及迁移/区域/支持注意事项,部署和管理Azure Databricks工作区、应用、模型、代理和管道。

Troubleshooting

故障排除

TopicURL
Monitor Genie space activity with audit log querieshttps://learn.microsoft.com/en-us/azure/databricks/ai-bi/admin/audit
Interpret Databricks enhanced security audit log schemashttps://learn.microsoft.com/en-us/azure/databricks/archive/security/monitor-log-schemas
Access guides, migration, and troubleshooting for Serverless GPUhttps://learn.microsoft.com/en-us/azure/databricks/compute/serverless/sgc-guides
Troubleshoot Azure Databricks compute startup issueshttps://learn.microsoft.com/en-us/azure/databricks/compute/troubleshooting/
Resolve Databricks classic compute termination error codeshttps://learn.microsoft.com/en-us/azure/databricks/compute/troubleshooting/cluster-error-codes
Debug Spark applications using Databricks Spark UIhttps://learn.microsoft.com/en-us/azure/databricks/compute/troubleshooting/debugging-spark-ui
Troubleshoot common Delta Sharing errorshttps://learn.microsoft.com/en-us/azure/databricks/delta-sharing/troubleshooting
Drop Delta features to fix compatibility issueshttps://learn.microsoft.com/en-us/azure/databricks/delta/drop-feature
Troubleshoot common Databricks CLI errors and issueshttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/troubleshooting
Use Databricks app details page for monitoring and debugginghttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-apps/view-app-details
Troubleshoot Databricks Connect for Python issueshttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/python/troubleshooting
Troubleshoot Databricks Connect for Scala problemshttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/scala/troubleshooting
Troubleshoot common Databricks Terraform provider errorshttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/terraform/troubleshoot
Resolve common issues with Databricks VS Code extensionhttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/vscode-ext/faqs
Troubleshoot Databricks VS Code extension errorshttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/vscode-ext/troubleshooting
Handle ARITHMETIC_OVERFLOW errors in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/arithmetic-overflow-error-class
Resolve CAST_INVALID_INPUT errors in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/cast-invalid-input-error-class
Understand DC_GA4_RAW_DATA_ERROR in Databricks connectorshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/dc-ga4-raw-data-error-error-class
Understand DC_SFDC_API_ERROR in Databricks connectorshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/dc-sfdc-api-error-error-class
Understand DC_SQLSERVER_ERROR in Databricks connectorshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/dc-sqlserver-error-error-class
Handle DELTA_ICEBERG_COMPAT_V1_VIOLATION errors in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/delta-iceberg-compat-v1-violation-error-class
Handle DIVIDE_BY_ZERO errors in Databricks SQLhttps://learn.microsoft.com/en-us/azure/databricks/error-messages/divide-by-zero-error-class
Reference Databricks error conditions for programmatic handlinghttps://learn.microsoft.com/en-us/azure/databricks/error-messages/error-classes
Diagnose EWKB_PARSE_ERROR issues in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/ewkb-parse-error-error-class
Diagnose EWKT_PARSE_ERROR issues in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/ewkt-parse-error-error-class
Diagnose GEOJSON_PARSE_ERROR issues in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/geojson-parse-error-error-class
Resolve GROUP_BY_AGGREGATE errors in Databricks SQLhttps://learn.microsoft.com/en-us/azure/databricks/error-messages/group-by-aggregate-error-class
Handle H3_INVALID_CELL_ID errors in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/h3-invalid-cell-id-error-class
Handle H3_INVALID_GRID_DISTANCE_VALUE errors in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/h3-invalid-grid-distance-value-error-class
Handle H3_INVALID_RESOLUTION_VALUE errors in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/h3-invalid-resolution-value-error-class
Handle H3_NOT_ENABLED errors and tier requirementshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/h3-not-enabled-error-class
Understand INSUFFICIENT_TABLE_PROPERTY errors in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/insufficient-table-property-error-class
Resolve INVALID_ARRAY_INDEX errors in Databricks SQLhttps://learn.microsoft.com/en-us/azure/databricks/error-messages/invalid-array-index-error-class
Resolve INVALID_ARRAY_INDEX_IN_ELEMENT_AT errorshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/invalid-array-index-in-element-at-error-class
Resolve MISSING_AGGREGATION errors in Databricks SQLhttps://learn.microsoft.com/en-us/azure/databricks/error-messages/missing-aggregation-error-class
Understand ROW_COLUMN_ACCESS errors in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/row-column-access-error-class
Map Databricks errors to SQLSTATE codeshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/sqlstates
Resolve TABLE_OR_VIEW_NOT_FOUND errors in Databricks SQLhttps://learn.microsoft.com/en-us/azure/databricks/error-messages/table-or-view-not-found-error-class
Fix UNRESOLVED_ROUTINE function resolution errors in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/unresolved-routine-error-class
Handle UNSUPPORTED_TABLE_OPERATION errors in Azure Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/unsupported-table-operation-error-class
Handle UNSUPPORTED_VIEW_OPERATION errors in Azure Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/unsupported-view-operation-error-class
Troubleshoot WKB_PARSE_ERROR geometry parsing in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/wkb-parse-error-error-class
Troubleshoot WKT_PARSE_ERROR geometry parsing in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/error-messages/wkt-parse-error-error-class
Troubleshoot Mosaic AI Agent Evaluation issueshttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-evaluation/troubleshooting
Troubleshoot and debug Databricks AI agent deploymentshttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-framework/debug-agent
Troubleshoot common issues in Databricks Genie spaceshttps://learn.microsoft.com/en-us/azure/databricks/genie/troubleshooting
Resolve common Databricks Auto Loader issueshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/faq
Diagnose and fix Databricks Confluence ingestion issueshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/confluence-troubleshoot
Troubleshoot Dynamics 365 data ingestion with Lakeflow Connecthttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/d365-troubleshoot
Troubleshoot Google Ads connector ingestion issueshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/google-ads-troubleshoot
Troubleshoot Google Analytics raw data ingestion issueshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/google-analytics-troubleshoot
Troubleshoot HubSpot connector ingestion problemshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/hubspot-troubleshoot
Troubleshoot Jira connector authentication and OAuth issueshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/jira-troubleshoot
Troubleshoot Meta Ads Lakeflow ingestion issueshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/meta-ads-troubleshoot
Troubleshoot MySQL ingestion with Lakeflow Connecthttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/mysql-troubleshoot
Troubleshoot PostgreSQL ingestion with Lakeflow Connecthttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/postgresql-troubleshoot
Troubleshoot Salesforce ingestion with Lakeflow Connecthttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/salesforce-troubleshoot
Diagnose and fix Databricks ServiceNow connector issueshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/servicenow-troubleshoot
Troubleshoot Microsoft SharePoint ingestion in Lakeflowhttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/sharepoint-troubleshoot
Troubleshoot SQL Server ingestion with Lakeflow Connecthttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/sql-server-troubleshoot
Troubleshoot TikTok Ads connector in Lakeflowhttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/tiktok-ads-troubleshoot
Troubleshoot Workday HCM connector in Lakeflowhttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/workday-hcm-troubleshoot
Diagnose and fix Databricks Workday connector issueshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/workday-reports-troubleshoot
Troubleshoot Databricks Zendesk Support connector errorshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/zendesk-support-troubleshoot
Handle Zerobus Ingest errors and retrieshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/zerobus-errors
Use logging to troubleshoot Databricks init scriptshttps://learn.microsoft.com/en-us/azure/databricks/init-scripts/logs
Troubleshoot and repair Lakeflow Job failureshttps://learn.microsoft.com/en-us/azure/databricks/jobs/repair-job-failures
Monitor and troubleshoot Lakeflow pipelineshttps://learn.microsoft.com/en-us/azure/databricks/ldp/observability
Use query history to debug and optimize pipelineshttps://learn.microsoft.com/en-us/azure/databricks/ldp/query-history
Recover pipelines from streaming checkpoint corruptionhttps://learn.microsoft.com/en-us/azure/databricks/ldp/recover-streaming
Troubleshoot Databricks Feature Store issues and limitationshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/feature-store/troubleshooting-and-limitations
Debug common issues in Databricks Model Serving endpointshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/model-serving-debug
Diagnose and resolve Databricks model serving timeoutshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/model-serving-timeouts
Monitor Lakebase system operations and healthhttps://learn.microsoft.com/en-us/azure/databricks/oltp/projects/operations
Troubleshoot failing Spark jobs and removed executorshttps://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/failing-spark-jobs
Diagnose and fix Spark memory issues on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/spark-memory-issues
Troubleshoot Databricks Partner Connect issueshttps://learn.microsoft.com/en-us/azure/databricks/partner-connect/troubleshoot
Troubleshoot common Databricks Git folders errorshttps://learn.microsoft.com/en-us/azure/databricks/repos/errors-troubleshooting
Fetch cursor rows and handle SQLSTATE in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/control-flow/fetch-stmt
Use GET DIAGNOSTICS for SQL error handling in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/control-flow/get-diagnostics-stmt
Open cursors and handle errors with OPEN in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/control-flow/open-stmt
Re-raise handled conditions with RESIGNAL in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/control-flow/resignal-stmt
Raise custom conditions with SIGNAL in Databricks SQLhttps://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/control-flow/signal-stmt
Validate UTF-8 strings and handle INVALID_UTF8_STRINGhttps://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/functions/validate_utf8
Understand Databricks SQL query performance insightshttps://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/performance-insights
Use query history UI to debug Databricks SQLhttps://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-history
Interpret Databricks SQL query profiles for performancehttps://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-profile
主题链接
使用审计日志查询监控Genie空间活动https://learn.microsoft.com/en-us/azure/databricks/ai-bi/admin/audit
解读Databricks增强安全审计日志架构https://learn.microsoft.com/en-us/azure/databricks/archive/security/monitor-log-schemas
获取无服务器GPU的指南、迁移和故障排除内容https://learn.microsoft.com/en-us/azure/databricks/compute/serverless/sgc-guides
排查Azure Databricks计算资源启动问题https://learn.microsoft.com/en-us/azure/databricks/compute/troubleshooting/
解决Databricks经典计算资源终止错误代码https://learn.microsoft.com/en-us/azure/databricks/compute/troubleshooting/cluster-error-codes
使用Databricks Spark UI调试Spark应用https://learn.microsoft.com/en-us/azure/databricks/compute/troubleshooting/debugging-spark-ui
排查常见Delta Sharing错误https://learn.microsoft.com/en-us/azure/databricks/delta-sharing/troubleshooting
移除Delta功能以修复兼容性问题https://learn.microsoft.com/en-us/azure/databricks/delta/drop-feature
排查常见Databricks CLI错误与问题https://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/troubleshooting
使用Databricks应用详情页面进行监控和调试https://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-apps/view-app-details
排查Databricks Connect for Python问题https://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/python/troubleshooting
排查Databricks Connect for Scala问题https://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/scala/troubleshooting
排查常见Databricks Terraform提供程序错误https://learn.microsoft.com/en-us/azure/databricks/dev-tools/terraform/troubleshoot
解决Databricks VS Code扩展的常见问题https://learn.microsoft.com/en-us/azure/databricks/dev-tools/vscode-ext/faqs
排查Databricks VS Code扩展错误https://learn.microsoft.com/en-us/azure/databricks/dev-tools/vscode-ext/troubleshooting
处理Databricks中的ARITHMETIC_OVERFLOW错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/arithmetic-overflow-error-class
解决Databricks中的CAST_INVALID_INPUT错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/cast-invalid-input-error-class
理解Databricks连接器中的DC_GA4_RAW_DATA_ERRORhttps://learn.microsoft.com/en-us/azure/databricks/error-messages/dc-ga4-raw-data-error-error-class
理解Databricks连接器中的DC_SFDC_API_ERRORhttps://learn.microsoft.com/en-us/azure/databricks/error-messages/dc-sfdc-api-error-error-class
理解Databricks连接器中的DC_SQLSERVER_ERRORhttps://learn.microsoft.com/en-us/azure/databricks/error-messages/dc-sqlserver-error-error-class
处理Databricks中的DELTA_ICEBERG_COMPAT_V1_VIOLATION错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/delta-iceberg-compat-v1-violation-error-class
处理Databricks SQL中的DIVIDE_BY_ZERO错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/divide-by-zero-error-class
参考Databricks错误条件以进行程序化处理https://learn.microsoft.com/en-us/azure/databricks/error-messages/error-classes
诊断Databricks中的EWKB_PARSE_ERROR问题https://learn.microsoft.com/en-us/azure/databricks/error-messages/ewkb-parse-error-error-class
诊断Databricks中的EWKT_PARSE_ERROR问题https://learn.microsoft.com/en-us/azure/databricks/error-messages/ewkt-parse-error-error-class
诊断Databricks中的GEOJSON_PARSE_ERROR问题https://learn.microsoft.com/en-us/azure/databricks/error-messages/geojson-parse-error-error-class
解决Databricks SQL中的GROUP_BY_AGGREGATE错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/group-by-aggregate-error-class
处理Databricks中的H3_INVALID_CELL_ID错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/h3-invalid-cell-id-error-class
处理Databricks中的H3_INVALID_GRID_DISTANCE_VALUE错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/h3-invalid-grid-distance-value-error-class
处理Databricks中的H3_INVALID_RESOLUTION_VALUE错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/h3-invalid-resolution-value-error-class
处理H3_NOT_ENABLED错误及层级要求https://learn.microsoft.com/en-us/azure/databricks/error-messages/h3-not-enabled-error-class
理解Databricks中的INSUFFICIENT_TABLE_PROPERTY错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/insufficient-table-property-error-class
解决Databricks SQL中的INVALID_ARRAY_INDEX错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/invalid-array-index-error-class
解决INVALID_ARRAY_INDEX_IN_ELEMENT_AT错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/invalid-array-index-in-element-at-error-class
解决Databricks SQL中的MISSING_AGGREGATION错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/missing-aggregation-error-class
理解Databricks中的ROW_COLUMN_ACCESS错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/row-column-access-error-class
将Databricks错误映射到SQLSTATE代码https://learn.microsoft.com/en-us/azure/databricks/error-messages/sqlstates
解决Databricks SQL中的TABLE_OR_VIEW_NOT_FOUND错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/table-or-view-not-found-error-class
修复Databricks中的UNRESOLVED_ROUTINE函数解析错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/unresolved-routine-error-class
处理Azure Databricks中的UNSUPPORTED_TABLE_OPERATION错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/unsupported-table-operation-error-class
处理Azure Databricks中的UNSUPPORTED_VIEW_OPERATION错误https://learn.microsoft.com/en-us/azure/databricks/error-messages/unsupported-view-operation-error-class
排查Databricks中WKB_PARSE_ERROR几何解析问题https://learn.microsoft.com/en-us/azure/databricks/error-messages/wkb-parse-error-error-class
排查Databricks中WKT_PARSE_ERROR几何解析问题https://learn.microsoft.com/en-us/azure/databricks/error-messages/wkt-parse-error-error-class
排查Mosaic AI代理评估问题https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-evaluation/troubleshooting
排查和调试Databricks AI代理部署https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-framework/debug-agent
排查Databricks Genie空间的常见问题https://learn.microsoft.com/en-us/azure/databricks/genie/troubleshooting
解决常见Databricks Auto Loader问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/faq
诊断并修复Databricks Confluence摄入问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/confluence-troubleshoot
使用Lakeflow Connect排查Dynamics 365数据摄入问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/d365-troubleshoot
排查Google Ads连接器摄入问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/google-ads-troubleshoot
排查Google Analytics原始数据摄入问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/google-analytics-troubleshoot
排查HubSpot连接器摄入问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/hubspot-troubleshoot
排查Jira连接器身份验证和OAuth问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/jira-troubleshoot
排查Meta Ads Lakeflow摄入问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/meta-ads-troubleshoot
使用Lakeflow Connect排查MySQL摄入问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/mysql-troubleshoot
使用Lakeflow Connect排查PostgreSQL摄入问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/postgresql-troubleshoot
使用Lakeflow Connect排查Salesforce摄入问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/salesforce-troubleshoot
诊断并修复Databricks ServiceNow连接器问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/servicenow-troubleshoot
排查Lakeflow中的Microsoft SharePoint摄入问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/sharepoint-troubleshoot
使用Lakeflow Connect排查SQL Server摄入问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/sql-server-troubleshoot
排查Lakeflow中的TikTok Ads连接器问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/tiktok-ads-troubleshoot
排查Lakeflow中的Workday HCM连接器问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/workday-hcm-troubleshoot
诊断并修复Databricks Workday连接器问题https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/workday-reports-troubleshoot
排查Databricks Zendesk Support连接器错误https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/zendesk-support-troubleshoot
处理Zerobus摄入错误与重试https://learn.microsoft.com/en-us/azure/databricks/ingestion/zerobus-errors
使用日志排查Databricks初始化脚本问题https://learn.microsoft.com/en-us/azure/databricks/init-scripts/logs
排查并修复Lakeflow Job失败问题https://learn.microsoft.com/en-us/azure/databricks/jobs/repair-job-failures
监控和排查Lakeflow管道https://learn.microsoft.com/en-us/azure/databricks/ldp/observability
使用查询历史调试和优化管道https://learn.microsoft.com/en-us/azure/databricks/ldp/query-history
从流处理检查点损坏中恢复管道https://learn.microsoft.com/en-us/azure/databricks/ldp/recover-streaming
排查Databricks Feature Store问题与限制https://learn.microsoft.com/en-us/azure/databricks/machine-learning/feature-store/troubleshooting-and-limitations
调试Databricks Model Serving端点的常见问题https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/model-serving-debug
诊断并解决Databricks模型服务超时问题https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/model-serving-timeouts
监控Lakebase系统操作与健康状态https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/operations
排查失败的Spark作业和已移除的执行器问题https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/failing-spark-jobs
诊断并修复Databricks上的Spark内存问题https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/spark-memory-issues
排查Databricks Partner Connect问题https://learn.microsoft.com/en-us/azure/databricks/partner-connect/troubleshoot
排查常见Databricks Git文件夹错误https://learn.microsoft.com/en-us/azure/databricks/repos/errors-troubleshooting
在Databricks中获取游标行并处理SQLSTATEhttps://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/control-flow/fetch-stmt
在Databricks中使用GET DIAGNOSTICS进行SQL错误处理https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/control-flow/get-diagnostics-stmt
在Databricks中打开游标并使用OPEN处理错误https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/control-flow/open-stmt
在Databricks中使用RESIGNAL重新抛出已处理的条件https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/control-flow/resignal-stmt
在Databricks SQL中使用SIGNAL抛出自定义条件https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/control-flow/signal-stmt
验证UTF-8字符串并处理INVALID_UTF8_STRINGhttps://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/functions/validate_utf8
理解Databricks SQL查询性能洞察https://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/performance-insights
使用查询历史UI调试Databricks SQLhttps://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-history
解读Databricks SQL查询配置文件以优化性能https://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-profile

Best Practices

最佳实践

TopicURL
Tag Databricks resources for cost attribution and trackinghttps://learn.microsoft.com/en-us/azure/databricks/admin/account-settings/usage-detail-tags
Use Databricks default compute policy families effectivelyhttps://learn.microsoft.com/en-us/azure/databricks/admin/clusters/policy-families
Apply identity best practices in Azure Databrickshttps://learn.microsoft.com/en-us/azure/databricks/admin/users-groups/best-practices
Apply best practices for Databricks serverless workspaceshttps://learn.microsoft.com/en-us/azure/databricks/admin/workspace/serverless-workspaces-best-practices
Migrate Databricks library installs from init scriptshttps://learn.microsoft.com/en-us/azure/databricks/archive/compute/libraries-init-scripts
Apply best practices for Databricks compute policieshttps://learn.microsoft.com/en-us/azure/databricks/archive/compute/policies-best-practices
Use DBIO for transactional writes to cloud storage in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/archive/legacy/dbio-commit
Optimize skewed joins in Databricks using skew hintshttps://learn.microsoft.com/en-us/azure/databricks/archive/legacy/skew-join
Apply Azure Databricks platform administration best practiceshttps://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/administration
Optimize BI performance with Databricks SQL warehouseshttps://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/bi-serving
Prepare and model data for high-performance BI on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/bi-serving-data-prep
Configure Databricks SQL warehouses for optimal BI servinghttps://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/bi-serving-sql-serving
Follow best practices for Azure Databricks compute creationhttps://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/compute
Implement best practices for Azure Databricks production jobshttps://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/jobs
Best practices for Power BI dashboards on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/power-bi
Apply Databricks compute configuration recommendationshttps://learn.microsoft.com/en-us/azure/databricks/compute/cluster-config-best-practices
Use flexible node types for reliable Databricks computehttps://learn.microsoft.com/en-us/azure/databricks/compute/flexible-node-types
Apply best practices for Databricks poolshttps://learn.microsoft.com/en-us/azure/databricks/compute/pool-best-practices
Apply serverless compute best practices in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/compute/serverless/best-practices
Apply data loading best practices on Databricks Serverless GPUhttps://learn.microsoft.com/en-us/azure/databricks/compute/serverless/sgc-dataloading
Track experiments and monitor GPUs on Databricks Serverlesshttps://learn.microsoft.com/en-us/azure/databricks/compute/serverless/sgc-tracking-observability
Tune Databricks SQL warehouses for BI workloadshttps://learn.microsoft.com/en-us/azure/databricks/compute/sql-warehouse/bi-workload-settings
Control large interactive queries with Query Watchdoghttps://learn.microsoft.com/en-us/azure/databricks/compute/troubleshooting/query-watchdog
Optimize Databricks dashboard performance with cachinghttps://learn.microsoft.com/en-us/azure/databricks/dashboards/caching
Apply observability best practices for Databricks jobs and pipelineshttps://learn.microsoft.com/en-us/azure/databricks/data-engineering/observability-best-practices
Apply schema evolution strategies in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/data-engineering/schema-evolution
Best practices for UDFs in Unity Catalog ABAC policieshttps://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/abac/udf-best-practices
Apply Unity Catalog best practices for data governancehttps://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/best-practices
Monitor fairness and bias for Databricks classification modelshttps://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/data-quality-monitoring/data-profiling/fairness-bias
Update Databricks jobs after Unity Catalog upgradehttps://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/jobs-update
Work with legacy Hive metastore database objectshttps://learn.microsoft.com/en-us/azure/databricks/database-objects/hive-metastore
Apply safe usage patterns for DBFS roothttps://learn.microsoft.com/en-us/azure/databricks/dbfs/dbfs-root
Use and migrate off DBFS mounts safelyhttps://learn.microsoft.com/en-us/azure/databricks/dbfs/mounts
Apply best practices for DBFS and Unity Cataloghttps://learn.microsoft.com/en-us/azure/databricks/dbfs/unity-catalog
Optimize Delta Sharing egress costshttps://learn.microsoft.com/en-us/azure/databricks/delta-sharing/manage-egress
Apply Delta Lake best practices on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/delta/best-practices
Use liquid clustering for Delta layouthttps://learn.microsoft.com/en-us/azure/databricks/delta/clustering
Improve queries with Delta data skippinghttps://learn.microsoft.com/en-us/azure/databricks/delta/data-skipping
Use deletion vectors to speed up Delta updateshttps://learn.microsoft.com/en-us/azure/databricks/delta/deletion-vectors
Safely drop or replace tables in Azure Databrickshttps://learn.microsoft.com/en-us/azure/databricks/delta/drop-table
Use Delta table history and time travel safelyhttps://learn.microsoft.com/en-us/azure/databricks/delta/history
Optimize Delta table layout with OPTIMIZEhttps://learn.microsoft.com/en-us/azure/databricks/delta/optimize
Handle Delta Lake limitations when using AWS S3https://learn.microsoft.com/en-us/azure/databricks/delta/s3-limitations
Control Delta data file size on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/delta/tune-file-size
Use VACUUM to remove stale Delta fileshttps://learn.microsoft.com/en-us/azure/databricks/delta/vacuum
Optimize VARIANT queries with shreddinghttps://learn.microsoft.com/en-us/azure/databricks/delta/variant-shredding
Apply Databricks-recommended CI/CD workflows and patternshttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/ci-cd/best-practices
List Databricks cluster policy families via CLIhttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/reference/policy-families-commands
Apply security and performance best practices for Databricks appshttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-apps/best-practices
Test Scala code using Databricks Connect and ScalaTesthttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/scala/testing
Run Python tests on Databricks via VS Codehttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/vscode-ext/pytest
Choose patterns for external access to Databricks datahttps://learn.microsoft.com/en-us/azure/databricks/external-access/
Choose between volumes and workspace files in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/files/files-recommendations
Customize AI judges for Databricks Agent Evaluationhttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-evaluation/advanced-agent-eval
Design effective evaluation sets for Databricks agentshttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-evaluation/evaluation-set
Synthetically generate agent evaluation setshttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-evaluation/synthesize-evaluation-set
Build and evaluate Databricks retrieval agentshttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/agent-framework-notebook
Measure RAG performance with Databricks metricshttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/evaluate-assess-performance
Create evaluation sets for Databricks RAG appshttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/evaluate-define-quality
Evaluate and monitor RAG apps on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/fundamentals-evaluation-monitoring-rag
Optimize Databricks RAG application qualityhttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/quality-overview
Improve Databricks RAG chain qualityhttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/quality-rag-chain
Configure Genie Code custom instructionshttps://learn.microsoft.com/en-us/azure/databricks/genie-code/instructions
Apply best practices to improve Genie Code responseshttps://learn.microsoft.com/en-us/azure/databricks/genie-code/tips
Evaluate Genie spaces using benchmarkshttps://learn.microsoft.com/en-us/azure/databricks/genie/benchmarks
Curate effective Azure Databricks Genie spaceshttps://learn.microsoft.com/en-us/azure/databricks/genie/best-practices
Build Genie knowledge stores for accurate responseshttps://learn.microsoft.com/en-us/azure/databricks/genie/knowledge-store
Use trusted assets to provide verified Genie answershttps://learn.microsoft.com/en-us/azure/databricks/genie/trusted-assets
Migrate existing Auto Loader streams to file eventshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/migrating-to-file-events
Apply common Auto Loader data loading patternshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/patterns
Configure Databricks Auto Loader for production workloadshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/production
Configure Auto Loader with Unity Catalog for secure ingestionhttps://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/unity-catalog
Apply common COPY INTO data loading patternshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/copy-into/examples
Ingest local and internet files into Databrickshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/file-upload/
Download and store internet data in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/file-upload/download-internet-files
Apply common patterns to Lakeflow ingestion pipelineshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/common-patterns
Perform full refreshes of Lakeflow target tableshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/full-refresh
Analyze Lakeflow ingestion costs with billing tableshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/monitor-costs
Perform ongoing maintenance for Lakeflow pipelineshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/pipeline-maintenance
Maintain and operate PostgreSQL ingestion pipelines in Lakeflowhttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/postgresql-maintenance
Optimize incremental ingestion of Salesforce formula fieldshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/salesforce-formula-fields
Use init scripts to customize Databricks clustershttps://learn.microsoft.com/en-us/azure/databricks/init-scripts/
Reference external files safely in Databricks init scriptshttps://learn.microsoft.com/en-us/azure/databricks/init-scripts/referencing-files
Test applications using Databricks JDBC Driver (Simba)https://learn.microsoft.com/en-us/azure/databricks/integrations/jdbc/testing
Test applications using the Databricks ODBC Driverhttps://learn.microsoft.com/en-us/azure/databricks/integrations/odbc/testing
Configure compute resources for Lakeflow Jobs efficientlyhttps://learn.microsoft.com/en-us/azure/databricks/jobs/compute
Set up recurring, backfillable jobs with parametershttps://learn.microsoft.com/en-us/azure/databricks/jobs/how-to/create-recurring-job
Apply best practices to classic Lakeflow Jobshttps://learn.microsoft.com/en-us/azure/databricks/jobs/run-classic-jobs
Apply cost optimization best practices on Databricks lakehousehttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/cost-optimization/best-practices
Implement data and AI governance best practices on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/data-governance/best-practices
Apply interoperability and usability best practices on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/interoperability-and-usability/best-practices
Apply operational excellence best practices on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/operational-excellence/best-practices
Apply performance efficiency best practices on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/performance-efficiency/best-practices
Apply reliability best practices on Databricks lakehousehttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/reliability/best-practices
Implement security, compliance, and privacy best practices on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/security-compliance-and-privacy/best-practices
Optimize pipeline clusters with enhanced autoscalinghttps://learn.microsoft.com/en-us/azure/databricks/ldp/auto-scaling
Apply best practices for Lakeflow Spark Declarative Pipelineshttps://learn.microsoft.com/en-us/azure/databricks/ldp/best-practices
Use advanced AUTO CDC features and monitor processing metricshttps://learn.microsoft.com/en-us/azure/databricks/ldp/cdc-advanced
Apply development and testing best practices to Lakeflow pipelineshttps://learn.microsoft.com/en-us/azure/databricks/ldp/develop
Manage Python dependencies in Lakeflow pipelineshttps://learn.microsoft.com/en-us/azure/databricks/ldp/developer/external-dependencies
Apply advanced expectation patterns and scaling strategieshttps://learn.microsoft.com/en-us/azure/databricks/ldp/expectation-patterns
Reduce pipeline initialization latency by restructuring flowshttps://learn.microsoft.com/en-us/azure/databricks/ldp/fix-high-init
Develop and debug ETL pipelines with the Lakeflow Pipelines Editorhttps://learn.microsoft.com/en-us/azure/databricks/ldp/multi-file-editor
Use legacy notebook experience to develop Lakeflow pipelineshttps://learn.microsoft.com/en-us/azure/databricks/ldp/notebook-devex
Optimize stateful streaming with watermarks in pipelineshttps://learn.microsoft.com/en-us/azure/databricks/ldp/stateful-processing
Design CDC and snapshot patterns in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/ldp/what-is-change-data-capture
Restart Python process to refresh Databricks librarieshttps://learn.microsoft.com/en-us/azure/databricks/libraries/restart-python-process
Apply Hyperopt best practices and troubleshooting on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/automl-hyperparam-tuning/hyperopt-best-practices
Implement point-in-time correct feature joinshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/feature-store/time-series
Load and prepare data for ML on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/load-data/
Perform batch inference on Spark DataFrames with registered modelshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-inference/dl-model-inference
Configure Locust-based load tests for Databricks endpointshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/configure-load-test
Validate models before Databricks Model Serving deploymenthttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/model-serving-pre-deployment-validation
Optimize Databricks Model Serving endpoints for productionhttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/production-optimization
Plan and execute load testing for Databricks serving endpointshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/what-is-load-test
Tune and scale Ray clusters on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/ray/scale-ray
Implement distributed image inference on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/reference-solutions/images-etl-inference
Follow deep learning best practices on Azure Databrickshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/train-model/dl-best-practices
Fine-tune Hugging Face models on a single GPU in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/train-model/huggingface/fine-tune-model
Prepare datasets for Hugging Face fine-tuning on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/train-model/huggingface/load-data
Apply data modeling best practices for Databricks metric viewshttps://learn.microsoft.com/en-us/azure/databricks/metric-views/data-modeling/
Apply composability patterns in metric viewshttps://learn.microsoft.com/en-us/azure/databricks/metric-views/data-modeling/composability
Define joins in Databricks metric view YAMLhttps://learn.microsoft.com/en-us/azure/databricks/metric-views/data-modeling/joins
Use semantic metadata in Databricks metric viewshttps://learn.microsoft.com/en-us/azure/databricks/metric-views/data-modeling/semantic-metadata
Implement window measures in metric viewshttps://learn.microsoft.com/en-us/azure/databricks/metric-views/data-modeling/window-measures
Use materialization to optimize metric view querieshttps://learn.microsoft.com/en-us/azure/databricks/metric-views/materialization
Adapt existing Apache Spark workloads to Azure Databrickshttps://learn.microsoft.com/en-us/azure/databricks/migration/spark
Align MLflow LLM judges with human evaluatorshttps://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/eval-monitor/align-judges
Developer workflow for MLflow code-based scorershttps://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/eval-monitor/custom-scorer-dev-workflow
Automatically optimize prompts with MLflow GEPAhttps://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/prompt-version-mgmt/prompt-registry/automatically-optimize-prompts
Evaluate and compare MLflow prompt versionshttps://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/prompt-version-mgmt/prompt-registry/evaluate-prompts
Use manual MLflow tracing for production GenAI appshttps://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/tracing/app-instrumentation/manual-tracing/
Analyze GenAI traces for errors and performancehttps://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/tracing/observe-with-traces/analyze-traces
Run Databricks notebooks safely and efficientlyhttps://learn.microsoft.com/en-us/azure/databricks/notebooks/run-notebook
Monitor and analyze active Lakebase querieshttps://learn.microsoft.com/en-us/azure/databricks/oltp/projects/active-queries
Implement branch-based development in Lakebasehttps://learn.microsoft.com/en-us/azure/databricks/oltp/projects/dev-workflow-tutorial
Analyze Lakebase query performance historyhttps://learn.microsoft.com/en-us/azure/databricks/oltp/projects/query-performance
Follow Databricks performance optimization guidancehttps://learn.microsoft.com/en-us/azure/databricks/optimizations/
Use adaptive query execution on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/optimizations/aqe
Leverage cost-based optimizer in Databricks SQLhttps://learn.microsoft.com/en-us/azure/databricks/optimizations/cbo
Improve read performance with Databricks disk cachehttps://learn.microsoft.com/en-us/azure/databricks/optimizations/disk-cache
Speed up queries with dynamic file pruninghttps://learn.microsoft.com/en-us/azure/databricks/optimizations/dynamic-file-pruning
Optimize Delta MERGE with low shuffle mergehttps://learn.microsoft.com/en-us/azure/databricks/optimizations/low-shuffle-merge
Accelerate data access with predictive I/Ohttps://learn.microsoft.com/en-us/azure/databricks/optimizations/predictive-io
Tune Azure Databricks range join performancehttps://learn.microsoft.com/en-us/azure/databricks/optimizations/range-join
Diagnose Databricks Spark cost and performance in UIhttps://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/
Use Spark jobs timeline to debug Databricks workloadshttps://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/jobs-timeline
Diagnose long-running Spark jobs in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/long-spark-stage
Analyze high I/O Spark stages in Databricks UIhttps://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/long-spark-stage-io
Debug skew and spill in Databricks Spark stageshttps://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/long-spark-stage-page
Handle Databricks spot instance losses effectivelyhttps://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/losing-spot-instances
Resolve long Spark stages with a single taskhttps://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/one-spark-task
Debug slow Spark stages with low I/O in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/slow-spark-stage-low-io
Optimize many small Spark jobs on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/small-spark-jobs
Identify expensive reads in Databricks Spark DAGshttps://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/spark-dag-expensive-read
Mitigate overloaded Spark driver on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/spark-driver-overloaded
Diagnose gaps between Spark jobs in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/spark-job-gaps
Detect unnecessary data rewriting in Databricks Spark writeshttps://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/spark-rewriting-data
Best practices for setting up Databricks Partner Connecthttps://learn.microsoft.com/en-us/azure/databricks/partner-connect/best-practice
Configure networking for Databricks Lakehouse Federationhttps://learn.microsoft.com/en-us/azure/databricks/query-federation/networking
Optimize performance of Databricks Lakehouse Federation querieshttps://learn.microsoft.com/en-us/azure/databricks/query-federation/performance-recommendations
Encrypt inter-node traffic in Databricks clustershttps://learn.microsoft.com/en-us/azure/databricks/security/keys/encrypt-otw
Optimize transformations on complex and nested data typeshttps://learn.microsoft.com/en-us/azure/databricks/semi-structured/complex-types
Use higher-order functions to process arrays in Databricks SQLhttps://learn.microsoft.com/en-us/azure/databricks/semi-structured/higher-order-functions
Use VOID (NULL) type correctly in Databricks SQLhttps://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/data-types/null-type
Work with OBJECT type and VARIANT schemas in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/data-types/object-type
Use TIMESTAMP_NTZ type and Delta support in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/data-types/timestamp-ntz-type
Use VARIANT type and Iceberg compatibility in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/data-types/variant-type
Collect table statistics with ANALYZE TABLE for optimizationhttps://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/sql-ref-syntax-aux-analyze-compute-statistics
Optimize Databricks SQL queries using hintshttps://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/sql-ref-syntax-qry-select-hints
Benchmark Databricks SQL with TPC-DS sample datasetshttps://learn.microsoft.com/en-us/azure/databricks/sql/tpcds-eval
Use Databricks SQL query caching for performancehttps://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-caching
Use Databricks SQL query filters effectivelyhttps://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-filters
Optimize queries using primary key constraints in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-optimization-constraints
Work with query parameters in Databricks SQLhttps://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-parameters
Create and use query snippets in Databricks SQLhttps://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-snippets
Use Structured Streaming checkpoints correctly on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/structured-streaming/checkpoints
Implement Delta Lake streaming reads and writes in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/structured-streaming/delta-lake
Choose Structured Streaming output modes on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/structured-streaming/output-mode
Optimize Databricks Structured Streaming for productionhttps://learn.microsoft.com/en-us/azure/databricks/structured-streaming/production
Optimize stateless Structured Streaming queries on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/structured-streaming/stateless-streaming
Monitor Structured Streaming queries using Databricks toolshttps://learn.microsoft.com/en-us/azure/databricks/structured-streaming/stream-monitoring
Combine Unity Catalog with Structured Streaming workloadshttps://learn.microsoft.com/en-us/azure/databricks/structured-streaming/unity-catalog
Apply watermarks for efficient stateful streaminghttps://learn.microsoft.com/en-us/azure/databricks/structured-streaming/watermarks
Optimize partition discovery for Unity Catalog external tableshttps://learn.microsoft.com/en-us/azure/databricks/tables/external-partition-discovery
Analyze Databricks table size and storage costshttps://learn.microsoft.com/en-us/azure/databricks/tables/size
Aggregate data with batch, streaming, and viewshttps://learn.microsoft.com/en-us/azure/databricks/transform/aggregation
Design data models optimized for Azure Databrickshttps://learn.microsoft.com/en-us/azure/databricks/transform/data-modeling
Use joins effectively in Databricks batch and streaminghttps://learn.microsoft.com/en-us/azure/databricks/transform/join
Optimize join performance for Azure Databricks workloadshttps://learn.microsoft.com/en-us/azure/databricks/transform/optimize-joins
Implement data cleaning and validation on Azure Databrickshttps://learn.microsoft.com/en-us/azure/databricks/transform/validate
Optimize Mosaic AI Vector Search performancehttps://learn.microsoft.com/en-us/azure/databricks/vector-search/vector-search-best-practices
Design and run load tests for vector search endpointshttps://learn.microsoft.com/en-us/azure/databricks/vector-search/vector-search-endpoint-load-test
Improve Mosaic AI Vector Search retrieval qualityhttps://learn.microsoft.com/en-us/azure/databricks/vector-search/vector-search-retrieval-quality
主题链接
标记Databricks资源以进行成本归因和跟踪https://learn.microsoft.com/en-us/azure/databricks/admin/account-settings/usage-detail-tags
有效使用Databricks默认计算资源策略家族https://learn.microsoft.com/en-us/azure/databricks/admin/clusters/policy-families
在Azure Databricks中应用身份管理最佳实践https://learn.microsoft.com/en-us/azure/databricks/admin/users-groups/best-practices
应用Databricks无服务器工作区最佳实践https://learn.microsoft.com/en-us/azure/databricks/admin/workspace/serverless-workspaces-best-practices
将Databricks库安装从初始化脚本迁移出去https://learn.microsoft.com/en-us/azure/databricks/archive/compute/libraries-init-scripts
应用Databricks计算资源策略最佳实践https://learn.microsoft.com/en-us/azure/databricks/archive/compute/policies-best-practices
在Databricks中使用DBIO向云存储进行事务性写入https://learn.microsoft.com/en-us/azure/databricks/archive/legacy/dbio-commit
在Databricks中使用倾斜提示优化倾斜连接https://learn.microsoft.com/en-us/azure/databricks/archive/legacy/skew-join
应用Azure Databricks平台管理最佳实践https://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/administration
使用Databricks SQL仓库优化BI性能https://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/bi-serving
为Databricks上的高性能BI准备和建模数据https://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/bi-serving-data-prep
配置Databricks SQL仓库以实现最佳BI服务https://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/bi-serving-sql-serving
遵循Azure Databricks计算资源创建最佳实践https://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/compute
为Azure Databricks生产作业实施最佳实践https://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/jobs
Databricks上Power BI仪表板的最佳实践https://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/power-bi
应用Databricks计算资源配置建议https://learn.microsoft.com/en-us/azure/databricks/compute/cluster-config-best-practices
使用灵活节点类型实现可靠的Databricks计算资源https://learn.microsoft.com/en-us/azure/databricks/compute/flexible-node-types
应用Databricks资源池最佳实践https://learn.microsoft.com/en-us/azure/databricks/compute/pool-best-practices
在Databricks中应用无服务器计算资源最佳实践https://learn.microsoft.com/en-us/azure/databricks/compute/serverless/best-practices
在Databricks无服务器GPU上应用数据加载最佳实践https://learn.microsoft.com/en-us/azure/databricks/compute/serverless/sgc-dataloading
在Databricks无服务器上跟踪实验并监控GPUhttps://learn.microsoft.com/en-us/azure/databricks/compute/serverless/sgc-tracking-observability
为BI工作负载调优Databricks SQL仓库https://learn.microsoft.com/en-us/azure/databricks/compute/sql-warehouse/bi-workload-settings
使用Query Watchdog控制大型交互式查询https://learn.microsoft.com/en-us/azure/databricks/compute/troubleshooting/query-watchdog
使用缓存优化Databricks仪表板性能https://learn.microsoft.com/en-us/azure/databricks/dashboards/caching
为Databricks作业和管道应用可观测性最佳实践https://learn.microsoft.com/en-us/azure/databricks/data-engineering/observability-best-practices
在Databricks中应用架构演进策略https://learn.microsoft.com/en-us/azure/databricks/data-engineering/schema-evolution
Unity Catalog ABAC策略中UDF的最佳实践https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/abac/udf-best-practices
应用Unity Catalog数据治理最佳实践https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/best-practices
监控Databricks分类模型的公平性与偏差https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/data-quality-monitoring/data-profiling/fairness-bias
Unity Catalog升级后更新Databricks作业https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/jobs-update
处理遗留Hive元数据库对象https://learn.microsoft.com/en-us/azure/databricks/database-objects/hive-metastore
应用DBFS根目录的安全使用模式https://learn.microsoft.com/en-us/azure/databricks/dbfs/dbfs-root
安全使用和迁移DBFS挂载https://learn.microsoft.com/en-us/azure/databricks/dbfs/mounts
应用DBFS与Unity Catalog的最佳实践https://learn.microsoft.com/en-us/azure/databricks/dbfs/unity-catalog
优化Delta Sharing出口成本https://learn.microsoft.com/en-us/azure/databricks/delta-sharing/manage-egress
在Databricks上应用Delta Lake最佳实践https://learn.microsoft.com/en-us/azure/databricks/delta/best-practices
使用液态聚类优化Delta布局https://learn.microsoft.com/en-us/azure/databricks/delta/clustering
使用Delta数据跳跃提升查询性能https://learn.microsoft.com/en-us/azure/databricks/delta/data-skipping
使用删除向量加速Delta更新https://learn.microsoft.com/en-us/azure/databricks/delta/deletion-vectors
在Azure Databricks中安全删除或替换表https://learn.microsoft.com/en-us/azure/databricks/delta/drop-table
安全使用Delta表历史和时间旅行https://learn.microsoft.com/en-us/azure/databricks/delta/history
使用OPTIMIZE优化Delta表布局https://learn.microsoft.com/en-us/azure/databricks/delta/optimize
使用AWS S3时处理Delta Lake限制https://learn.microsoft.com/en-us/azure/databricks/delta/s3-limitations
在Databricks上控制Delta数据文件大小https://learn.microsoft.com/en-us/azure/databricks/delta/tune-file-size
使用VACUUM移除陈旧Delta文件https://learn.microsoft.com/en-us/azure/databricks/delta/vacuum
使用分片优化VARIANT查询https://learn.microsoft.com/en-us/azure/databricks/delta/variant-shredding
应用Databricks推荐的CI/CD工作流和模式https://learn.microsoft.com/en-us/azure/databricks/dev-tools/ci-cd/best-practices
通过CLI列出Databricks集群策略家族https://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/reference/policy-families-commands
为Databricks应用应用安全与性能最佳实践https://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-apps/best-practices
使用Databricks Connect和ScalaTest测试Scala代码https://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/scala/testing
通过VS Code在Databricks上运行Python测试https://learn.microsoft.com/en-us/azure/databricks/dev-tools/vscode-ext/pytest
选择Databricks数据的外部访问模式https://learn.microsoft.com/en-us/azure/databricks/external-access/
在Databricks中选择卷与工作区文件https://learn.microsoft.com/en-us/azure/databricks/files/files-recommendations
为Databricks代理评估自定义AI评判器https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-evaluation/advanced-agent-eval
为Databricks代理设计有效的评估集https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-evaluation/evaluation-set
合成生成代理评估集https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-evaluation/synthesize-evaluation-set
构建并评估Databricks检索代理https://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/agent-framework-notebook
使用Databricks指标衡量RAG性能https://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/evaluate-assess-performance
为Databricks RAG应用创建评估集https://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/evaluate-define-quality
在Databricks上评估和监控RAG应用https://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/fundamentals-evaluation-monitoring-rag
优化Databricks RAG应用质量https://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/quality-overview
提升Databricks RAG链质量https://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/quality-rag-chain
配置Genie Code自定义指令https://learn.microsoft.com/en-us/azure/databricks/genie-code/instructions
应用最佳实践提升Genie Code响应质量https://learn.microsoft.com/en-us/azure/databricks/genie-code/tips
使用基准测试评估Genie空间https://learn.microsoft.com/en-us/azure/databricks/genie/benchmarks
精心打造有效的Azure Databricks Genie空间https://learn.microsoft.com/en-us/azure/databricks/genie/best-practices
构建Genie知识存储以提供准确响应https://learn.microsoft.com/en-us/azure/databricks/genie/knowledge-store
使用可信资产提供经过验证的Genie答案https://learn.microsoft.com/en-us/azure/databricks/genie/trusted-assets
将现有Auto Loader流迁移到文件事件模式https://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/migrating-to-file-events
应用常见Auto Loader数据加载模式https://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/patterns
为生产工作负载配置Databricks Auto Loaderhttps://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/production
配置带有Unity Catalog的Auto Loader以实现安全摄入https://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/unity-catalog
应用常见COPY INTO数据加载模式https://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/copy-into/examples
将本地和互联网文件摄入到Databrickshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/file-upload/
在Databricks中下载并存储互联网数据https://learn.microsoft.com/en-us/azure/databricks/ingestion/file-upload/download-internet-files
为Lakeflow摄入管道应用常见模式https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/common-patterns
对Lakeflow目标表执行全量刷新https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/full-refresh
使用账单表分析Lakeflow摄入成本https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/monitor-costs
对Lakeflow管道执行持续维护https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/pipeline-maintenance
在Lakeflow中维护和操作PostgreSQL摄入管道https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/postgresql-maintenance
优化Salesforce公式字段的增量摄入https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/salesforce-formula-fields
使用初始化脚本自定义Databricks集群https://learn.microsoft.com/en-us/azure/databricks/init-scripts/
在Databricks初始化脚本中安全引用外部文件https://learn.microsoft.com/en-us/azure/databricks/init-scripts/referencing-files
使用Databricks JDBC驱动(Simba)测试应用https://learn.microsoft.com/en-us/azure/databricks/integrations/jdbc/testing
使用Databricks ODBC驱动测试应用https://learn.microsoft.com/en-us/azure/databricks/integrations/odbc/testing
高效配置Lakeflow Jobs的计算资源https://learn.microsoft.com/en-us/azure/databricks/jobs/compute
设置带有参数的定期可回填作业https://learn.microsoft.com/en-us/azure/databricks/jobs/how-to/create-recurring-job
为经典Lakeflow Jobs应用最佳实践https://learn.microsoft.com/en-us/azure/databricks/jobs/run-classic-jobs
在Databricks湖仓上应用成本优化最佳实践https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/cost-optimization/best-practices
在Databricks上实施数据与AI治理最佳实践https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/data-governance/best-practices
在Databricks上应用互操作性与可用性最佳实践https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/interoperability-and-usability/best-practices
为Databricks湖仓应用运营卓越最佳实践https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/operational-excellence/best-practices
为Databricks湖仓应用性能效率最佳实践https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/performance-efficiency/best-practices
为Databricks湖仓应用可靠性最佳实践https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/reliability/best-practices
在Databricks上实施安全、合规与隐私最佳实践https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/security-compliance-and-privacy/best-practices
使用增强型自动缩放优化管道集群https://learn.microsoft.com/en-us/azure/databricks/ldp/auto-scaling
为Lakeflow Spark声明式管道应用最佳实践https://learn.microsoft.com/en-us/azure/databricks/ldp/best-practices
使用高级AUTO CDC功能并监控处理指标https://learn.microsoft.com/en-us/azure/databricks/ldp/cdc-advanced
为Lakeflow管道应用开发与测试最佳实践https://learn.microsoft.com/en-us/azure/databricks/ldp/develop
在Lakeflow管道中管理Python依赖项https://learn.microsoft.com/en-us/azure/databricks/ldp/developer/external-dependencies
应用高级期望模式与扩展策略https://learn.microsoft.com/en-us/azure/databricks/ldp/expectation-patterns
通过重构流减少管道初始化延迟https://learn.microsoft.com/en-us/azure/databricks/ldp/fix-high-init
使用Lakeflow管道编辑器开发和调试ETL管道https://learn.microsoft.com/en-us/azure/databricks/ldp/multi-file-editor
使用遗留笔记本体验开发Lakeflow管道https://learn.microsoft.com/en-us/azure/databricks/ldp/notebook-devex
在管道中使用水印优化有状态流处理https://learn.microsoft.com/en-us/azure/databricks/ldp/stateful-processing
在Databricks中设计CDC和快照模式https://learn.microsoft.com/en-us/azure/databricks/ldp/what-is-change-data-capture
重启Python进程以刷新Databricks库https://learn.microsoft.com/en-us/azure/databricks/libraries/restart-python-process
在Databricks上应用Hyperopt最佳实践与故障排除https://learn.microsoft.com/en-us/azure/databricks/machine-learning/automl-hyperparam-tuning/hyperopt-best-practices
实现时间点正确的特征连接https://learn.microsoft.com/en-us/azure/databricks/machine-learning/feature-store/time-series
在Databricks上加载和准备ML数据https://learn.microsoft.com/en-us/azure/databricks/machine-learning/load-data/
使用已注册模型在Spark DataFrame上执行批量推理https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-inference/dl-model-inference
为Databricks端点配置基于Locust的负载测试https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/configure-load-test
在Databricks Model Serving部署前验证模型https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/model-serving-pre-deployment-validation
为生产环境优化Databricks Model Serving端点https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/production-optimization
规划并执行Databricks服务端点的负载测试https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/what-is-load-test
在Databricks上调优和扩展Ray集群https://learn.microsoft.com/en-us/azure/databricks/machine-learning/ray/scale-ray
在Databricks上实施分布式图像推理https://learn.microsoft.com/en-us/azure/databricks/machine-learning/reference-solutions/images-etl-inference
在Azure Databricks上遵循深度学习最佳实践https://learn.microsoft.com/en-us/azure/databricks/machine-learning/train-model/dl-best-practices
在Databricks的单个GPU上微调Hugging Face模型https://learn.microsoft.com/en-us/azure/databricks/machine-learning/train-model/huggingface/fine-tune-model
在Databricks上为Hugging Face微调准备数据集https://learn.microsoft.com/en-us/azure/databricks/machine-learning/train-model/huggingface/load-data
为Databricks指标视图应用数据建模最佳实践https://learn.microsoft.com/en-us/azure/databricks/metric-views/data-modeling/
在指标视图中应用组合模式https://learn.microsoft.com/en-us/azure/databricks/metric-views/data-modeling/composability
在Databricks指标视图YAML中定义连接https://learn.microsoft.com/en-us/azure/databricks/metric-views/data-modeling/joins
在Databricks指标视图中使用语义元数据https://learn.microsoft.com/en-us/azure/databricks/metric-views/data-modeling/semantic-metadata
在指标视图中实现窗口度量https://learn.microsoft.com/en-us/azure/databricks/metric-views/data-modeling/window-measures
使用物化优化指标视图查询https://learn.microsoft.com/en-us/azure/databricks/metric-views/materialization
将现有Apache Spark工作负载适配到Azure Databrickshttps://learn.microsoft.com/en-us/azure/databricks/migration/spark
使MLflow LLM评判器与人工评估者保持一致https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/eval-monitor/align-judges
MLflow基于代码的评分器开发工作流https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/eval-monitor/custom-scorer-dev-workflow
使用MLflow GEPA自动优化提示词https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/prompt-version-mgmt/prompt-registry/automatically-optimize-prompts
评估和比较MLflow提示词版本https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/prompt-version-mgmt/prompt-registry/evaluate-prompts
对生产环境生成式AI应用使用手动MLflow追踪https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/tracing/app-instrumentation/manual-tracing/
分析生成式AI追踪数据以排查错误和性能问题https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/tracing/observe-with-traces/analyze-traces
安全高效地运行Databricks笔记本https://learn.microsoft.com/en-us/azure/databricks/notebooks/run-notebook
监控和分析活跃Lakebase查询https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/active-queries
在Lakebase中实现基于分支的开发https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/dev-workflow-tutorial
分析Lakebase查询性能历史https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/query-performance
遵循Databricks性能优化指南https://learn.microsoft.com/en-us/azure/databricks/optimizations/
在Databricks上使用自适应查询执行https://learn.microsoft.com/en-us/azure/databricks/optimizations/aqe
在Databricks SQL中利用基于成本的优化器https://learn.microsoft.com/en-us/azure/databricks/optimizations/cbo
使用Databricks磁盘缓存提升读取性能https://learn.microsoft.com/en-us/azure/databricks/optimizations/disk-cache
使用动态文件修剪加速查询https://learn.microsoft.com/en-us/azure/databricks/optimizations/dynamic-file-pruning
使用低混洗合并优化Delta MERGEhttps://learn.microsoft.com/en-us/azure/databricks/optimizations/low-shuffle-merge
使用预测性I/O加速数据访问https://learn.microsoft.com/en-us/azure/databricks/optimizations/predictive-io
调优Azure Databricks范围连接性能https://learn.microsoft.com/en-us/azure/databricks/optimizations/range-join
在UI中诊断Databricks Spark成本与性能https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/
使用Spark作业时间线调试Databricks工作负载https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/jobs-timeline
诊断Databricks中长时间运行的Spark作业https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/long-spark-stage
在Databricks UI中分析高I/O Spark阶段https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/long-spark-stage-io
调试Databricks Spark阶段中的倾斜和溢出问题https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/long-spark-stage-page
有效处理Databricks抢占式实例丢失问题https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/losing-spot-instances
解决单个任务导致的Spark阶段长时间运行问题https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/one-spark-task
调试Databricks中低I/O的缓慢Spark阶段https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/slow-spark-stage-low-io
优化Databricks上的大量小型Spark作业https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/small-spark-jobs
在Databricks Spark DAG中识别高成本读取https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/spark-dag-expensive-read
缓解Databricks上Spark驱动程序过载问题https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/spark-driver-overloaded
诊断Databricks中Spark作业之间的间隙https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/spark-job-gaps
检测Databricks Spark写入中的不必要数据重写https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/spark-rewriting-data
设置Databricks Partner Connect的最佳实践https://learn.microsoft.com/en-us/azure/databricks/partner-connect/best-practice
为Databricks湖仓联邦配置网络https://learn.microsoft.com/en-us/azure/databricks/query-federation/networking
优化Databricks湖仓联邦查询性能https://learn.microsoft.com/en-us/azure/databricks/query-federation/performance-recommendations
在Databricks集群中加密节点间流量https://learn.microsoft.com/en-us/azure/databricks/security/keys/encrypt-otw
优化复杂和嵌套数据类型的转换https://learn.microsoft.com/en-us/azure/databricks/semi-structured/complex-types
在Databricks SQL中使用高阶函数处理数组https://learn.microsoft.com/en-us/azure/databricks/semi-structured/higher-order-functions
在Databricks SQL中正确使用VOID(NULL)类型https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/data-types/null-type
在Databricks中处理OBJECT类型和VARIANT架构https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/data-types/object-type
在Databricks中使用TIMESTAMP_NTZ类型和Delta支持https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/data-types/timestamp-ntz-type
在Databricks中使用VARIANT类型和Iceberg兼容性https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/data-types/variant-type
使用ANALYZE TABLE收集表统计信息以优化性能https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/sql-ref-syntax-aux-analyze-compute-statistics
使用提示词优化Databricks SQL查询https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/sql-ref-syntax-qry-select-hints
使用TPC-DS示例数据集基准测试Databricks SQLhttps://learn.microsoft.com/en-us/azure/databricks/sql/tpcds-eval
使用Databricks SQL查询缓存提升性能https://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-caching
有效使用Databricks SQL查询过滤器https://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-filters
在Databricks中使用主键约束优化查询https://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-optimization-constraints
在Databricks SQL中使用查询参数https://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-parameters
在Databricks SQL中创建和使用查询片段https://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-snippets
在Databricks上正确使用结构化流处理检查点https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/checkpoints
在Databricks上实现Delta Lake流处理读写https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/delta-lake
在Databricks上选择结构化流处理输出模式https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/output-mode
为生产环境优化Databricks结构化流处理https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/production
优化Databricks上的无状态结构化流处理查询https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/stateless-streaming
使用Databricks工具监控结构化流处理查询https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/stream-monitoring
将Unity Catalog与结构化流处理工作负载结合使用https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/unity-catalog
使用水印实现高效的有状态流处理https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/watermarks
优化Unity Catalog外部表的分区发现https://learn.microsoft.com/en-us/azure/databricks/tables/external-partition-discovery
分析Databricks表大小和存储成本https://learn.microsoft.com/en-us/azure/databricks/tables/size
使用批处理、流处理和视图聚合数据https://learn.microsoft.com/en-us/azure/databricks/transform/aggregation
为Azure Databricks设计优化的数据模型https://learn.microsoft.com/en-us/azure/databricks/transform/data-modeling
在Databricks批处理和流处理中有效使用连接https://learn.microsoft.com/en-us/azure/databricks/transform/join
为Azure Databricks工作负载优化连接性能https://learn.microsoft.com/en-us/azure/databricks/transform/optimize-joins
在Azure Databricks上实施数据清理和验证https://learn.microsoft.com/en-us/azure/databricks/transform/validate
优化Mosaic AI向量搜索性能https://learn.microsoft.com/en-us/azure/databricks/vector-search/vector-search-best-practices
为向量搜索端点设计并运行负载测试https://learn.microsoft.com/en-us/azure/databricks/vector-search/vector-search-endpoint-load-test
提升Mosaic AI向量搜索检索质量https://learn.microsoft.com/en-us/azure/databricks/vector-search/vector-search-retrieval-quality

Decision Making

决策制定

TopicURL
Plan migration from Databricks Standard to Premium tierhttps://learn.microsoft.com/en-us/azure/databricks/admin/account-settings/standard-tier
Evaluate and create Azure Databricks serverless workspaceshttps://learn.microsoft.com/en-us/azure/databricks/admin/workspace/serverless-workspaces
Decide and migrate from dbx to Databricks bundleshttps://learn.microsoft.com/en-us/azure/databricks/archive/dev-tools/dbx/dbx-migrate
Migrate optimized LLM endpoints to provisioned throughputhttps://learn.microsoft.com/en-us/azure/databricks/archive/machine-learning/migrate-provisioned-throughput
Decide and migrate to Databricks Runtime 10.xhttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/10.x-migration
Migrate workloads to Databricks Runtime 11.xhttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/11.x-migration
Migrate workloads to Databricks Runtime 12.xhttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/12.x-migration
Plan migration to Databricks Runtime 13.xhttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/13.x-migration
Plan migration to Databricks Runtime 14.xhttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/14.x-migration
Use Databricks Runtime 6.4 Extended Support strategicallyhttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/6.4x
Plan migration to Databricks Runtime 7.3 LTShttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/7.3-migration
Migrate workloads from Databricks Runtime 6.x to 7.xhttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/7.x-migration
Plan migration to Databricks Runtime 9.1 LTShttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/9.1-migration
Plan migration of Databricks workloads to Spark 3.xhttps://learn.microsoft.com/en-us/azure/databricks/archive/spark-3.x-migration/
Migrate from Deep Learning Pipelines to newer Databricks MLhttps://learn.microsoft.com/en-us/azure/databricks/archive/spark-3.x-migration/deep-learning-pipelines
Select and manage the default Unity Catalog cataloghttps://learn.microsoft.com/en-us/azure/databricks/catalogs/default
Select the right Databricks compute type for workloadshttps://learn.microsoft.com/en-us/azure/databricks/compute/choose-compute
Decide when and how to use GPU Databricks computehttps://learn.microsoft.com/en-us/azure/databricks/compute/gpu
Decide when to use Databricks pools vs serverlesshttps://learn.microsoft.com/en-us/azure/databricks/compute/pool-index
Plan Databricks SQL warehouse sizing and queuinghttps://learn.microsoft.com/en-us/azure/databricks/compute/sql-warehouse/warehouse-behavior
Choose between Databricks SQL warehouse typeshttps://learn.microsoft.com/en-us/azure/databricks/compute/sql-warehouse/warehouse-types
Choose and configure Azure Databricks data connectionshttps://learn.microsoft.com/en-us/azure/databricks/connect/
Plan and execute upgrade to Unity Cataloghttps://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/upgrade/
Choose between Delta Sharing, Marketplace, and Clean Roomshttps://learn.microsoft.com/en-us/azure/databricks/data-sharing/
Choose Delta Lake protocol and featureshttps://learn.microsoft.com/en-us/azure/databricks/delta/feature-compatibility
Choose local development tools for Azure Databrickshttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/
Migrate from legacy to new Databricks CLI versionshttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/migrate
Manage Databricks account budget policies via CLIhttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/reference/account-budget-policy-commands
Configure Databricks account budgets using CLIhttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/reference/account-budgets-commands
Manage Databricks account usage dashboards via CLIhttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/reference/account-usage-dashboards-commands
Choose appropriate compute size for Databricks Appshttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-apps/compute-size
Migrate Python projects to new Databricks Connect runtimeshttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/python/migrate
Migrate from legacy to new Scala Databricks Connecthttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/scala/migrate
Choose and use Databricks SDKs for automationhttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/sdks
Select SQL connectors and tools for Azure Databrickshttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/sql-drivers-tools
Decide between CDKTF and Databricks Terraform providerhttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/terraform/cdktf
Select Unity Catalog integration approach by enginehttps://learn.microsoft.com/en-us/azure/databricks/external-access/integrations
Migrate Databricks Community Edition to Free Editionhttps://learn.microsoft.com/en-us/azure/databricks/getting-started/ce-migration
Choose between Databricks Free Edition and free trialhttps://learn.microsoft.com/en-us/azure/databricks/getting-started/free-trial-vs-free-edition
Choose between Auto Loader directory listing and file notificationhttps://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/file-detection-modes
Plan migration of existing data to Delta Lake on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/ingestion/data-migration/
Migrate from Simba Spark to Databricks ODBC Driverhttps://learn.microsoft.com/en-us/azure/databricks/integrations/odbc/migration
Migrate from Spark Submit task to supported Lakeflow taskshttps://learn.microsoft.com/en-us/azure/databricks/jobs/spark-submit
Select a development language for Databrickshttps://learn.microsoft.com/en-us/azure/databricks/languages/overview
Choose between triggered and continuous pipeline modeshttps://learn.microsoft.com/en-us/azure/databricks/ldp/pipeline-mode
Migrate online feature tables to Lakebasehttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/feature-store/migrate-from-online-tables
Migrate Databricks models and workflows to Unity Cataloghttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/manage-model-lifecycle/migrate-to-uc
Upgrade Databricks ML workflows to Unity Cataloghttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/manage-model-lifecycle/upgrade-workflows
Choose Databricks options for batch model inferencehttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-inference/
Migrate from legacy MLflow serving to Mosaic AI Model Servinghttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/migrate-model-serving
Decide when to use Spark vs. Ray on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/ray/spark-ray-overview
Decide when and how to use distributed training on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/train-model/distributed-training/
Plan migration of data applications to Azure Databrickshttps://learn.microsoft.com/en-us/azure/databricks/migration/
Assess options for migrating ETL pipelines to Databrickshttps://learn.microsoft.com/en-us/azure/databricks/migration/etl
Choose a migration path from Parquet to Delta Lakehttps://learn.microsoft.com/en-us/azure/databricks/migration/parquet-to-delta-lake
Migrate enterprise data warehouses to the Databricks lakehousehttps://learn.microsoft.com/en-us/azure/databricks/migration/warehouse-to-lakehouse
Decide and migrate from Agent Evaluation to MLflow 3https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/agent-eval-migration
Quick reference for migrating to MLflow 3https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/agent-eval-migration-reference
Plan and adjust Lakebase instance capacityhttps://learn.microsoft.com/en-us/azure/databricks/oltp/instances/create/capacity
Evaluate Lakebase Postgres Autoscaling capabilities and use caseshttps://learn.microsoft.com/en-us/azure/databricks/oltp/projects/about
Choose Lakebase backup and restore methodshttps://learn.microsoft.com/en-us/azure/databricks/oltp/projects/backup-methods
Choose how to connect applications to Lakebase Postgreshttps://learn.microsoft.com/en-us/azure/databricks/oltp/projects/connect-application
Understand default autoscaling behavior for new Lakebase instanceshttps://learn.microsoft.com/en-us/azure/databricks/oltp/upgrade-to-autoscaling
Choose and configure incremental refresh for materialized viewshttps://learn.microsoft.com/en-us/azure/databricks/optimizations/incremental-refresh
Choose pandas options and patterns on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/pandas/
Use pandas API on Spark effectively on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/pandas/pandas-on-spark
Migrate legacy Databricks query federation to Lakehouse Federationhttps://learn.microsoft.com/en-us/azure/databricks/query-federation/migrate
Choose appropriate Azure Databricks preview release typehttps://learn.microsoft.com/en-us/azure/databricks/release-notes/release-types
Decide on Databricks runtime and feature lifecycle supporthttps://learn.microsoft.com/en-us/azure/databricks/release-notes/runtime/databricks-runtime-ver
Interpret serverless DBU consumption by Azure Databricks SKUhttps://learn.microsoft.com/en-us/azure/databricks/resources/pricing
Decide between VARIANT and JSON strings for semi-structured datahttps://learn.microsoft.com/en-us/azure/databricks/semi-structured/variant-json-diff
Decide between Spark Connect and Spark Classichttps://learn.microsoft.com/en-us/azure/databricks/spark/connect-vs-classic
Choose between SparkR and sparklyr on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/sparkr/sparkr-vs-sparklyr
Migrate to the latest Databricks SQL REST APIhttps://learn.microsoft.com/en-us/azure/databricks/sql/dbsql-api-latest
Choose synchronous vs asynchronous state checkpointing in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/structured-streaming/async-checkpointing
Choose interactive vs non-interactive Databricks transactionshttps://learn.microsoft.com/en-us/azure/databricks/transactions/transaction-modes
Choose cost-efficient Mosaic AI Vector Search endpointshttps://learn.microsoft.com/en-us/azure/databricks/vector-search/vector-search-cost-management
主题链接
规划从Databricks标准版到高级版的迁移https://learn.microsoft.com/en-us/azure/databricks/admin/account-settings/standard-tier
评估并创建Azure Databricks无服务器工作区https://learn.microsoft.com/en-us/azure/databricks/admin/workspace/serverless-workspaces
决定并从dbx迁移到Databricks bundleshttps://learn.microsoft.com/en-us/azure/databricks/archive/dev-tools/dbx/dbx-migrate
将优化后的LLM端点迁移到预配吞吐量https://learn.microsoft.com/en-us/azure/databricks/archive/machine-learning/migrate-provisioned-throughput
决定并迁移到Databricks Runtime 10.xhttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/10.x-migration
将工作负载迁移到Databricks Runtime 11.xhttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/11.x-migration
将工作负载迁移到Databricks Runtime 12.xhttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/12.x-migration
规划迁移到Databricks Runtime 13.xhttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/13.x-migration
规划迁移到Databricks Runtime 14.xhttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/14.x-migration
战略性使用Databricks Runtime 6.4扩展支持https://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/6.4x
规划迁移到Databricks Runtime 7.3 LTShttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/7.3-migration
将工作负载从Databricks Runtime 6.x迁移到7.xhttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/7.x-migration
规划迁移到Databricks Runtime 9.1 LTShttps://learn.microsoft.com/en-us/azure/databricks/archive/runtime-release-notes/9.1-migration
规划Databricks工作负载到Spark 3.x的迁移https://learn.microsoft.com/en-us/azure/databricks/archive/spark-3.x-migration/
从深度学习管道迁移到较新的Databricks MLhttps://learn.microsoft.com/en-us/azure/databricks/archive/spark-3.x-migration/deep-learning-pipelines
选择并管理默认Unity Catalog目录https://learn.microsoft.com/en-us/azure/databricks/catalogs/default
为工作负载选择合适的Databricks计算资源类型https://learn.microsoft.com/en-us/azure/databricks/compute/choose-compute
决定何时以及如何使用GPU Databricks计算资源https://learn.microsoft.com/en-us/azure/databricks/compute/gpu
决定何时使用Databricks资源池与无服务器https://learn.microsoft.com/en-us/azure/databricks/compute/pool-index
规划Databricks SQL仓库大小和排队https://learn.microsoft.com/en-us/azure/databricks/compute/sql-warehouse/warehouse-behavior
在Databricks SQL仓库类型之间选择https://learn.microsoft.com/en-us/azure/databricks/compute/sql-warehouse/warehouse-types
选择并配置Azure Databricks数据连接https://learn.microsoft.com/en-us/azure/databricks/connect/
规划并执行Unity Catalog升级https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/upgrade/
在Delta Sharing、Marketplace和Clean Rooms之间选择https://learn.microsoft.com/en-us/azure/databricks/data-sharing/
选择Delta Lake协议和功能https://learn.microsoft.com/en-us/azure/databricks/delta/feature-compatibility
为Azure Databricks选择本地开发工具https://learn.microsoft.com/en-us/azure/databricks/dev-tools/
从旧版迁移到新版Databricks CLIhttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/migrate
通过CLI管理Databricks账户预算策略https://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/reference/account-budget-policy-commands
使用CLI配置Databricks账户预算https://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/reference/account-budgets-commands
通过CLI管理Databricks账户使用情况仪表板https://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/reference/account-usage-dashboards-commands
为Databricks Apps选择合适的计算资源大小https://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-apps/compute-size
将Python项目迁移到新版Databricks Connect运行时https://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/python/migrate
从旧版迁移到新版Scala Databricks Connecthttps://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/scala/migrate
选择并使用Databricks SDK进行自动化https://learn.microsoft.com/en-us/azure/databricks/dev-tools/sdks
为Azure Databricks选择SQL连接器和工具https://learn.microsoft.com/en-us/azure/databricks/dev-tools/sql-drivers-tools
在CDKTF与Databricks Terraform提供程序之间选择https://learn.microsoft.com/en-us/azure/databricks/dev-tools/terraform/cdktf
按引擎选择Unity Catalog集成方式https://learn.microsoft.com/en-us/azure/databricks/external-access/integrations
将Databricks社区版迁移到免费版https://learn.microsoft.com/en-us/azure/databricks/getting-started/ce-migration
在Databricks免费版与免费试用版之间选择https://learn.microsoft.com/en-us/azure/databricks/getting-started/free-trial-vs-free-edition
在Auto Loader目录列出与文件通知模式之间选择https://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/file-detection-modes
规划现有数据到Databricks上Delta Lake的迁移https://learn.microsoft.com/en-us/azure/databricks/ingestion/data-migration/
从Simba Spark迁移到Databricks ODBC驱动https://learn.microsoft.com/en-us/azure/databricks/integrations/odbc/migration
从Spark Submit任务迁移到受支持的Lakeflow任务https://learn.microsoft.com/en-us/azure/databricks/jobs/spark-submit
为Databricks选择开发语言https://learn.microsoft.com/en-us/azure/databricks/languages/overview
在触发式与持续管道模式之间选择https://learn.microsoft.com/en-us/azure/databricks/ldp/pipeline-mode
将在线特征表迁移到Lakebasehttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/feature-store/migrate-from-online-tables
将Databricks模型和工作流迁移到Unity Cataloghttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/manage-model-lifecycle/migrate-to-uc
将Databricks ML工作流升级到Unity Cataloghttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/manage-model-lifecycle/upgrade-workflows
为批处理模型推理选择Databricks选项https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-inference/
从旧版MLflow服务迁移到Mosaic AI Model Servinghttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/migrate-model-serving
决定何时在Databricks上使用Spark与Rayhttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/ray/spark-ray-overview
决定何时以及如何在Databricks上使用分布式训练https://learn.microsoft.com/en-us/azure/databricks/machine-learning/train-model/distributed-training/
规划数据应用到Azure Databricks的迁移https://learn.microsoft.com/en-us/azure/databricks/migration/
评估将ETL管道迁移到Databricks的选项https://learn.microsoft.com/en-us/azure/databricks/migration/etl
选择从Parquet到Delta Lake的迁移路径https://learn.microsoft.com/en-us/azure/databricks/migration/parquet-to-delta-lake
将企业数据仓库迁移到Databricks湖仓https://learn.microsoft.com/en-us/azure/databricks/migration/warehouse-to-lakehouse
决定并从Agent Evaluation迁移到MLflow 3https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/agent-eval-migration
迁移到MLflow 3的快速参考https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/agent-eval-migration-reference
规划并调整Lakebase实例容量https://learn.microsoft.com/en-us/azure/databricks/oltp/instances/create/capacity
评估Lakebase Postgres自动缩放功能和用例https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/about
选择Lakebase备份与恢复方法https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/backup-methods
选择如何将应用连接到Lakebase Postgreshttps://learn.microsoft.com/en-us/azure/databricks/oltp/projects/connect-application
了解新版Lakebase实例的默认自动缩放行为https://learn.microsoft.com/en-us/azure/databricks/oltp/upgrade-to-autoscaling
选择并配置物化视图的增量刷新https://learn.microsoft.com/en-us/azure/databricks/optimizations/incremental-refresh
在Databricks上选择pandas选项和模式https://learn.microsoft.com/en-us/azure/databricks/pandas/
在Databricks上有效使用Spark上的pandas APIhttps://learn.microsoft.com/en-us/azure/databricks/pandas/pandas-on-spark
将旧版Databricks查询联邦迁移到湖仓联邦https://learn.microsoft.com/en-us/azure/databricks/query-federation/migrate
选择合适的Azure Databricks预览发布类型https://learn.microsoft.com/en-us/azure/databricks/release-notes/release-types
决定Databricks运行时和功能生命周期支持https://learn.microsoft.com/en-us/azure/databricks/release-notes/runtime/databricks-runtime-ver
按Azure Databricks SKU解读无服务器DBU消耗https://learn.microsoft.com/en-us/azure/databricks/resources/pricing
在VARIANT与JSON字符串之间为半结构化数据选择https://learn.microsoft.com/en-us/azure/databricks/semi-structured/variant-json-diff
在Spark Connect与Spark Classic之间选择https://learn.microsoft.com/en-us/azure/databricks/spark/connect-vs-classic
在Databricks上选择SparkR与sparklyrhttps://learn.microsoft.com/en-us/azure/databricks/sparkr/sparkr-vs-sparklyr
迁移到最新Databricks SQL REST APIhttps://learn.microsoft.com/en-us/azure/databricks/sql/dbsql-api-latest
在Databricks中选择同步与异步状态检查点https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/async-checkpointing
在交互式与非交互式Databricks事务之间选择https://learn.microsoft.com/en-us/azure/databricks/transactions/transaction-modes
选择具有成本效益的Mosaic AI向量搜索端点https://learn.microsoft.com/en-us/azure/databricks/vector-search/vector-search-cost-management

Architecture & Design Patterns

架构与设计模式

TopicURL
Implement fan-in and fan-out patterns in Lakeflow pipelineshttps://learn.microsoft.com/en-us/azure/databricks/data-engineering/fan-in-fan-out
Design multi-agent supervisor systems with Agent Brickshttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-bricks/multi-agent-supervisor
Build Databricks multi-agent orchestrator appshttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-framework/multi-agent-apps
Create Genie-based multi-agent systems on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-framework/multi-agent-genie
Build non-conversational Databricks AI agents with MLflowhttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-framework/non-conversational-agents
Implement AI agent memory with Databricks Lakehousehttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-framework/stateful-agents
Implement AI agent memory on Databricks Model Servinghttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-framework/stateful-agents-model-serving
Apply Databricks design patterns for AI agentshttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/guide/agent-system-design-patterns
Design and tune Databricks RAG inference chainshttps://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/fundamentals-inference-chain-rag
Architect cost-optimized Databricks lakehouse solutionshttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/cost-optimization/
Design data and AI governance architecture for the lakehousehttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/data-governance/
Apply guiding architectural principles for Databricks lakehousehttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/guiding-principles
Architect interoperability and usability for Databricks lakehousehttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/interoperability-and-usability/
Architect operational excellence for the Databricks lakehousehttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/operational-excellence/
Architect performance efficiency for Databricks lakehousehttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/performance-efficiency/
Use Databricks lakehouse reference architectures on Azurehttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/reference
Architect reliability for the Databricks data lakehousehttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/reliability/
Apply Databricks well-architected lakehouse frameworkhttps://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/well-architected
Apply Databricks data lakehouse architecture patternhttps://learn.microsoft.com/en-us/azure/databricks/lakehouse/
Apply medallion lakehouse architecture on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/lakehouse/medallion
Replicate external RDBMS tables to Databricks using AUTO CDChttps://learn.microsoft.com/en-us/azure/databricks/ldp/database-replication
Choose Databricks ML model deployment patternshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/mlops/deployment-patterns
Design LLMOps workflows on Azure Databrickshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/mlops/llmops
Implement MLOps workflows on Azure Databrickshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/mlops/mlops-workflow
Choose and train deep-learning recommenders in Databrickshttps://learn.microsoft.com/en-us/azure/databricks/machine-learning/train-recommender-models
Use Lakebase branches for safe database evolutionhttps://learn.microsoft.com/en-us/azure/databricks/oltp/projects/branches
Understand Lakebase autoscaling, branches, and read replicashttps://learn.microsoft.com/en-us/azure/databricks/oltp/projects/core-concepts
Design high availability for Lakebase Postgres computeshttps://learn.microsoft.com/en-us/azure/databricks/oltp/projects/high-availability
Scale reads with Lakebase Postgres read replicashttps://learn.microsoft.com/en-us/azure/databricks/oltp/projects/read-replicas
Understand and apply Databricks catalog federationhttps://learn.microsoft.com/en-us/azure/databricks/query-federation/catalog-federation
Plan Hive metastore federation with Unity Cataloghttps://learn.microsoft.com/en-us/azure/databricks/query-federation/hms-federation-concepts
Choose patterns for modeling semi-structured data on Databrickshttps://learn.microsoft.com/en-us/azure/databricks/semi-structured/
Decide when to partition Databricks tableshttps://learn.microsoft.com/en-us/azure/databricks/tables/partitions
主题链接
在Lakeflow管道中实现扇入和扇出模式https://learn.microsoft.com/en-us/azure/databricks/data-engineering/fan-in-fan-out
使用Agent Bricks设计多代理监督系统https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-bricks/multi-agent-supervisor
构建Databricks多代理编排器应用https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-framework/multi-agent-apps
在Databricks上创建基于Genie的多代理系统https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-framework/multi-agent-genie
使用MLflow构建非对话式Databricks AI代理https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-framework/non-conversational-agents
使用Databricks湖仓实现AI代理内存https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-framework/stateful-agents
在Databricks Model Serving上实现AI代理内存https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-framework/stateful-agents-model-serving
为AI代理应用Databricks设计模式https://learn.microsoft.com/en-us/azure/databricks/generative-ai/guide/agent-system-design-patterns
设计并调优Databricks RAG推理链https://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/fundamentals-inference-chain-rag
构建具有成本优化的Databricks湖仓解决方案https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/cost-optimization/
为湖仓设计数据与AI治理架构https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/data-governance/
为Databricks湖仓应用指导性架构原则https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/guiding-principles
为Databricks湖仓构建互操作性与可用性https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/interoperability-and-usability/
为Databricks湖仓构建运营卓越性https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/operational-excellence/
为Databricks湖仓构建性能效率https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/performance-efficiency/
在Azure上使用Databricks湖仓参考架构https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/reference
为Databricks数据湖仓构建可靠性https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/reliability/
应用Databricks架构完善的湖仓框架https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/well-architected
应用Databricks数据湖仓架构模式https://learn.microsoft.com/en-us/azure/databricks/lakehouse/
在Databricks上应用medallion湖仓架构https://learn.microsoft.com/en-us/azure/databricks/lakehouse/medallion
使用AUTO CDC将外部RDBMS表复制到Databrickshttps://learn.microsoft.com/en-us/azure/databricks/ldp/database-replication
选择Databricks ML模型部署模式https://learn.microsoft.com/en-us/azure/databricks/machine-learning/mlops/deployment-patterns
在Azure Databricks上设计LLMOps工作流https://learn.microsoft.com/en-us/azure/databricks/machine-learning/mlops/llmops
在Azure Databricks上实施MLOps工作流https://learn.microsoft.com/en-us/azure/databricks/machine-learning/mlops/mlops-workflow
在Databricks中选择并训练深度学习推荐器https://learn.microsoft.com/en-us/azure/databricks/machine-learning/train-recommender-models
使用Lakebase分支实现安全的数据库演进https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/branches
了解Lakebase自动缩放、分支和只读副本https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/core-concepts
为Lakebase Postgres计算资源设计高可用性https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/high-availability
使用Lakebase Postgres只读副本扩展读取https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/read-replicas
理解并应用Databricks目录联邦https://learn.microsoft.com/en-us/azure/databricks/query-federation/catalog-federation
规划Hive元数据与Unity Catalog的联邦https://learn.microsoft.com/en-us/azure/databricks/query-federation/hms-federation-concepts
为Databricks上的半结构化数据选择建模模式https://learn.microsoft.com/en-us/azure/databricks/semi-structured/
决定何时对Databricks表进行分区https://learn.microsoft.com/en-us/azure/databricks/tables/partitions