bigquery

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Chinese

BigQuery is Google's serverless, highly scalable, and cost-effective multi-cloud data warehouse. It processes terabytes in seconds.

BigQuery是谷歌推出的无服务器、高可扩展且经济高效的多云数据仓库。它能在数秒内处理TB级数据。

Serverless Analytics: No infrastructure to manage. Just run SQL.
Real-time Analytics: High-speed streaming ingestion.
ML Integration:
```
CREATE MODEL
```
lets you train ML models using standard SQL (BigQuery ML).

sql

-- Standard SQL
SELECT name, COUNT(*) as count
FROM `bigquery-public-data.usa_names.usa_1910_2013`
GROUP BY name
ORDER BY count DESC
LIMIT 10;

sql

-- Standard SQL
SELECT name, COUNT(*) as count
FROM `bigquery-public-data.usa_names.usa_1910_2013`
GROUP BY name
ORDER BY count DESC
LIMIT 10;

A "Slot" is a unit of computational capacity. BigQuery autoscales slots, or you can reserve them for flat-rate pricing.

“Slot”是计算能力的单位。BigQuery可以自动扩展计算槽，您也可以通过预留计算槽享受固定费率定价。

Optimized for aggregation queries. Reading one column is much cheaper/faster than reading all columns (

SELECT *

is expensive).

专为聚合查询优化。读取单列数据比读取所有列（

SELECT *

）成本更低、速度更快。

Partitioning: Splits table by Date/Int (e.g., Daily partitions). Prunes data scanning massive cost savings.
Clustering: Sorts data within partitions for faster filtering.

Do:

Partition by Date: Almost mandatory for time-series logs.
Use BigQuery ML: Train models (Regression, K-Means) directly where data lives.
Estimate Cost:
```
Dry Run
```
your query to see how many bytes it will scan before running it.

Don't:

Don't run
SELECT *
: You pay per column read. Select only what you need.
Don't treat it like an OLTP: Single row inserts are slow (unless using Streaming API). It is for bulk analytics.

建议做法：

避免做法：