datasets
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseGoldsky Dataset Reference
Goldsky数据集参考
Reference tables for blockchain datasets available in Turbo pipelines.
For quick dataset questions (e.g., "what dataset for Solana transfers?"), answer directly: identify the chain prefix (see Popular Chain Prefixes below), identify the dataset type (see Common Datasets), and return a YAML snippet like:
yaml
sources:
my_source:
type: dataset
dataset_name: <chain>.<dataset>
version: 1.0.0
start_at: earliestTip: Useto verify a dataset exists (fast, ~3 seconds). Avoidgoldsky turbo validatewhich is slow (30-60+ seconds).goldsky dataset list
Turbo流水线中可用的区块链数据集参考表。
对于快速数据集问题(例如“Solana转账对应的数据集是什么?”),请直接回答:确认链前缀(见下方常用链前缀),确认数据集类型(见常用数据集),然后返回如下YAML片段:
yaml
sources:
my_source:
type: dataset
dataset_name: <chain>.<dataset>
version: 1.0.0
start_at: earliest提示: 使用验证数据集是否存在(速度快,约3秒)。避免使用速度很慢的goldsky turbo validate(需要30-60秒以上)。goldsky dataset list
Dataset Reference Files
数据集参考文件
Detailed dataset and chain information is in thefolder.data/
| File | Contents |
|---|---|
| All validated datasets with versions, schemas, and use cases |
| All chain prefixes, chain IDs, and common mistakes |
Data location: (relative to this skill's directory)
data/详细的数据集和链信息存放在文件夹中。data/
| 文件 | 内容说明 |
|---|---|
| 所有已验证的数据集,包含版本、schema和使用场景 |
| 所有链前缀、链ID以及常见错误说明 |
数据位置: (相对于本技能的目录)
data/Quick Reference
快速参考
| Action | Command | Notes |
|---|---|---|
| Validate dataset | | Preferred - fast (3s) |
| Search for dataset | | Slow (30-60s), use sparingly |
| List all datasets | | Very slow - avoid |
| 操作 | 命令 | 说明 |
|---|---|---|
| 验证数据集 | | 推荐使用 - 速度快(3秒) |
| 搜索数据集 | | 速度慢(30-60秒),谨慎使用 |
| 列出所有数据集 | | 速度极慢 - 避免使用 |
Common Datasets
常用数据集
| What You Need | Dataset | Example |
|---|---|---|
| Token transfers (ERC-20) | | |
| NFT transfers (ERC-721) | | |
| Transactions | | |
| Event logs | | |
| Solana tokens | | v1.0.0 |
| Bitcoin transactions | | v1.0.0 |
| Stellar transfers | | v1.1.0 |
Important: Use, NOTraw_transactionstransactions
| 所需数据类型 | 数据集名称 | 示例 |
|---|---|---|
| 代币转账(ERC-20) | | |
| NFT转账(ERC-721) | | |
| 交易记录 | | |
| 事件日志 | | |
| Solana代币 | | v1.0.0 |
| 比特币交易记录 | | v1.0.0 |
| Stellar转账 | | v1.1.0 |
重要提示: 请使用,不要使用raw_transactionstransactions
Popular Chain Prefixes
常用链前缀
| Chain | Prefix | Note |
|---|---|---|
| Ethereum | | |
| Base | | |
| Polygon | | NOT |
| Arbitrum | | |
| Optimism | | |
| BSC | | |
| Avalanche | | |
| Solana | | Uses |
| Bitcoin | | Uses |
| Stellar | | Uses |
| Sui | | Uses |
| NEAR | | Uses |
| Starknet | | Uses |
| Fogo | | Uses |
See for complete list with chain IDs.
data/chain-prefixes.json| 链名称 | 前缀 | 注意事项 |
|---|---|---|
| Ethereum | | |
| Base | | |
| Polygon | | 不要使用 |
| Arbitrum | | |
| Optimism | | |
| BSC | | |
| Avalanche | | |
| Solana | | 使用 |
| Bitcoin | | 和EVM一样使用 |
| Stellar | | 和EVM一样使用 |
| Sui | | 和EVM一样使用 |
| NEAR | | 和EVM一样使用 |
| Starknet | | 和EVM一样使用 |
| Fogo | | 和EVM一样使用 |
完整的链ID列表请查看。
data/chain-prefixes.jsonCommon Dataset Types
通用数据集类型
EVM Chains
EVM链
| Dataset Type | Description | Use Case |
|---|---|---|
| Block headers with metadata | Block explorers, timing analysis |
| Transaction data | Wallet activity, gas analysis |
| Raw event logs | Custom event filtering |
| Internal transaction traces | MEV analysis, contract interactions |
| Fungible token transfers | Token tracking, DeFi analytics |
| NFT transfers | NFT marketplaces, ownership tracking |
| Multi-token transfers | Gaming, multi-token standards |
| ABI-decoded event logs | Specific contract events |
Important: Use, NOTraw_transactions. Usetransactions, NOTraw_logs(thoughlogsworks as an alias on some chains).logs
| 数据集类型 | 说明 | 使用场景 |
|---|---|---|
| 带元数据的区块头 | 区块浏览器、时序分析 |
| 交易数据 | 钱包活动、gas费用分析 |
| 原始事件日志 | 自定义事件过滤 |
| 内部交易追踪 | MEV分析、合约交互 |
| 同质化代币转账 | 代币追踪、DeFi分析 |
| NFT转账 | NFT市场、所有权跟踪 |
| 多类型代币转账 | 游戏、多代币标准 |
| ABI解码后的事件日志 | 特定合约事件查询 |
重要提示: 请使用,不要使用raw_transactions。请使用transactions,不要使用raw_logs(部分链支持logs作为别名)。logs
Solana
Solana
| Dataset Type | Description | Use Case |
|---|---|---|
| Block data with leader info | Chain analysis |
| Transaction data with balances | Wallet activity |
| Transactions + nested instructions | Multi-instruction analysis |
| Individual instructions | Program-specific analysis |
| SPL token transfers | Token tracking |
| SOL balance changes | Whale tracking |
| SPL token balance changes | Portfolio tracking |
| Validator rewards | Staking analysis |
| 数据集类型 | 说明 | 使用场景 |
|---|---|---|
| 包含出块节点信息的区块数据 | 链数据分析 |
| 包含余额的交易数据 | 钱包活动 |
| 交易+嵌套指令 | 多指令分析 |
| 单条指令 | 特定程序分析 |
| SPL代币转账 | 代币追踪 |
| SOL余额变动 | 巨鲸追踪 |
| SPL代币余额变动 | 投资组合追踪 |
| 验证节点奖励 | 质押分析 |
Bitcoin
Bitcoin
| Dataset Type | Description | Use Case |
|---|---|---|
| Block data (hash, difficulty, size) | Network analysis |
| Transactions (inputs, outputs, values) | Payment tracking |
| 数据集类型 | 说明 | 使用场景 |
|---|---|---|
| 区块数据(哈希、难度、大小) | 网络分析 |
| 交易数据(输入、输出、金额) | 支付跟踪 |
Stellar
Stellar
All datasets use version :
1.1.0| Dataset Type | Description | Use Case |
|---|---|---|
| All network transactions | Account monitoring |
| All transfer events | Asset tracking |
| All events (contract + operation) | Contract monitoring |
| Operations within transactions | Action tracking |
| Ledger state changes | State analysis |
| Ledger metadata | Network analysis |
| Account balance changes | Balance tracking |
所有数据集版本均为:
1.1.0| 数据集类型 | 说明 | 使用场景 |
|---|---|---|
| 全网络交易记录 | 账户监控 |
| 所有转账事件 | 资产追踪 |
| 所有事件(合约+操作) | 合约监控 |
| 交易内的操作 | 行为跟踪 |
| 账本状态变更 | 状态分析 |
| 账本元数据 | 网络分析 |
| 账户余额变更 | 余额追踪 |
Sui
Sui
| Dataset Type | Description | Use Case |
|---|---|---|
| Checkpoint data | Chain analysis |
| Transaction data | Activity monitoring |
| Move contract events | dApp event tracking |
| Deployed Move packages | Package discovery |
| Epoch data with validators | Staking/validator analysis |
| 数据集类型 | 说明 | 使用场景 |
|---|---|---|
| 检查点数据 | 链数据分析 |
| 交易数据 | 活动监控 |
| Move合约事件 | dApp事件跟踪 |
| 已部署的Move包 | 包发现 |
| 包含验证节点信息的Epoch数据 | 质押/验证节点分析 |
NEAR
NEAR
| Dataset Type | Description | Use Case |
|---|---|---|
| Execution receipts | Contract interaction tracking |
| Signed transactions | Activity monitoring |
| Execution results | Success/failure analysis |
| 数据集类型 | 说明 | 使用场景 |
|---|---|---|
| 执行收据 | 合约交互跟踪 |
| 已签名交易 | 活动监控 |
| 执行结果 | 成功/失败分析 |
Starknet
Starknet
| Dataset Type | Description | Use Case |
|---|---|---|
| Block data | Chain analysis |
| Transaction data | Activity monitoring |
| Contract events | dApp event tracking |
| L1↔L2 messages | Bridge monitoring |
| 数据集类型 | 说明 | 使用场景 |
|---|---|---|
| 区块数据 | 链数据分析 |
| 交易数据 | 活动监控 |
| 合约事件 | dApp事件跟踪 |
| L1↔L2消息 | 跨链桥监控 |
Fogo
Fogo
| Dataset Type | Description | Use Case |
|---|---|---|
| Transactions with instructions | Full activity tracking |
| Validator rewards | Staking analysis |
| Block data | Chain analysis |
| 数据集类型 | 说明 | 使用场景 |
|---|---|---|
| 带指令的交易 | 全量活动追踪 |
| 验证节点奖励 | 质押分析 |
| 区块数据 | 链数据分析 |
Dataset Schemas
数据集Schema
Source: docs.goldsky.com. Do not use field names not listed here — ask the user to runto inspect unknown schemas.goldsky dataset list
来源: docs.goldsky.com。请勿使用此处未列出的字段名——如果遇到未知schema,请让用户执行查看。goldsky dataset list
Solana
Solana
solana.transactions
solana.transactionssolana.transactions
solana.transactions| Field | Type | Notes |
|---|---|---|
| string | |
| integer | tx position in block |
| integer | slot number |
| string | |
| timestamp | |
| string | transaction signature |
| string | |
| integer | in lamports |
| integer | 1 = success |
| string | null | error if failed |
| string[] | all involved accounts |
| object[] | |
| string[] | program execution logs |
| integer |
Noorfrom_addresson Solana transactions — useto_addressarray instead.accounts
| 字段 | 类型 | 说明 |
|---|---|---|
| string | |
| integer | 交易在区块中的位置 |
| integer | 槽位号 |
| string | |
| timestamp | |
| string | 交易签名 |
| string | |
| integer | 单位为lamports |
| integer | 1 = 交易成功 |
| string | null | 交易失败时返回错误信息 |
| string[] | 所有涉及的账户 |
| object[] | |
| string[] | 程序执行日志 |
| integer |
Solana交易没有或from_address字段 —— 请使用to_address数组代替。accounts
solana.transactions_with_instructions
solana.transactions_with_instructionssolana.transactions_with_instructions
solana.transactions_with_instructionsAll fields from plus:
solana.transactions| Field | Type | Notes |
|---|---|---|
| object[] | token balances before tx |
| object[] | token balances after tx |
| object[] | see below |
Instruction object fields: , , , , , , , , , (base58), (string[]), ,
idindexparent_indexblock_slotblock_timestampblock_hashtx_feetx_indexprogram_iddataaccountsstatuserr包含的所有字段,额外新增:
solana.transactions| 字段 | 类型 | 说明 |
|---|---|---|
| object[] | 交易前的代币余额 |
| object[] | 交易后的代币余额 |
| object[] | 见下方说明 |
指令对象字段: , , , , , , , , , (base58), (string[]), ,
idindexparent_indexblock_slotblock_timestampblock_hashtx_feetx_indexprogram_iddataaccountsstatuserrsolana.instructions
solana.instructionssolana.instructions
solana.instructions| Field | Type | Notes |
|---|---|---|
| string | |
| integer | position in tx |
| integer | null | for inner instructions |
| integer | |
| timestamp | |
| string | |
| string | executing program address |
| string | base58 encoded |
| string[] | instruction accounts |
| integer | |
| string | null |
| 字段 | 类型 | 说明 |
|---|---|---|
| string | |
| integer | 指令在交易中的位置 |
| integer | null | 内部指令对应的父指令索引 |
| integer | |
| timestamp | |
| string | |
| string | 执行程序的地址 |
| string | base58编码 |
| string[] | 指令涉及的账户 |
| integer | |
| string | null |
solana.token_transfers
solana.token_transferssolana.token_transfers
solana.token_transfers| Field | Type | Notes |
|---|---|---|
| string | |
| string | mint address |
| string | source token account |
| string | dest token account |
| number | raw amount |
| integer | token decimals |
| integer | |
| timestamp | |
| string | tx signature |
| 字段 | 类型 | 说明 |
|---|---|---|
| string | |
| string | 代币发行地址 |
| string | 源代币账户 |
| string | 目标代币账户 |
| number | 原始金额 |
| integer | 代币精度 |
| integer | |
| timestamp | |
| string | 交易签名 |
solana.native_balances
solana.native_balancessolana.native_balances
solana.native_balances| Field | Type | Notes |
|---|---|---|
| string | |
| integer | slot number |
| string | |
| timestamp | |
| integer | transaction position in block |
| string | transaction signature |
| string | account pubkey |
| integer | lamports |
| integer | lamports |
| string | Goldsky internal operation type |
| 字段 | 类型 | 说明 |
|---|---|---|
| string | |
| integer | 槽位号 |
| string | |
| timestamp | |
| integer | 交易在区块中的位置 |
| string | 交易签名 |
| string | 账户公钥 |
| integer | 单位为lamports |
| integer | 单位为lamports |
| string | Goldsky内部操作类型 |
solana.blocks
solana.blockssolana.blocks
solana.blocks| Field | Type | Notes |
|---|---|---|
| string | |
| integer | |
| integer | |
| string | |
| timestamp | |
| integer | |
| string | |
| integer | |
| string | validator pubkey |
| integer | lamports |
| boolean |
| 字段 | 类型 | 说明 |
|---|---|---|
| string | |
| integer | |
| integer | |
| string | |
| timestamp | |
| integer | |
| string | |
| integer | |
| string | 验证节点公钥 |
| integer | 单位为lamports |
| boolean |
solana.rewards
solana.rewardssolana.rewards
solana.rewards| Field | Type | Notes |
|---|---|---|
| string | |
| integer | |
| string | |
| timestamp | |
| string | validator pubkey |
| integer | reward amount |
| integer | balance after reward |
| string | |
| integer |
| 字段 | 类型 | 说明 |
|---|---|---|
| string | |
| integer | |
| string | |
| timestamp | |
| string | 验证节点公钥 |
| integer | 奖励金额 |
| integer | 奖励后的余额 |
| string | |
| integer |
solana.token_balances
solana.token_balancessolana.token_balances
solana.token_balancesSchema not fully documented — do not guess field names. Inspect with.goldsky dataset list | grep solana.token_balances
Schema未完全文档化 —— 不要猜测字段名。请使用查看。goldsky dataset list \| grep solana.token_balances
EVM Chains
EVM链
<chain>.raw_logs
/ <chain>.logs
<chain>.raw_logs<chain>.logs<chain>.raw_logs
/ <chain>.logs
<chain>.raw_logs<chain>.logs| Field | Type | Notes |
|---|---|---|
| string | |
| integer | |
| string | |
| string | |
| integer | |
| integer | |
| string | contract address (lowercase) |
| string | hex encoded event data |
| string | comma-separated hex topic hashes |
| integer | unix timestamp |
is a comma-separated string, not an array. Topic 0 is the event signature hash.topics
| 字段 | 类型 | 说明 |
|---|---|---|
| string | |
| integer | |
| string | |
| string | |
| integer | |
| integer | |
| string | 合约地址(小写) |
| string | 十六进制编码的事件数据 |
| string | 逗号分隔的十六进制主题哈希 |
| integer | Unix时间戳 |
是逗号分隔的字符串,不是数组。第0个主题是事件签名哈希。topics
<chain>.raw_transactions
<chain>.raw_transactions<chain>.raw_transactions
<chain>.raw_transactions| Field | Type | Notes |
|---|---|---|
| string | |
| string | |
| integer | |
| string | |
| integer | |
| integer | |
| string | |
| string | |
| decimal | ETH value in wei |
| decimal | |
| decimal | |
| string | hex calldata |
| integer | |
| integer | unix timestamp |
| decimal | |
| string | null | if contract creation |
| integer | 1 = success |
| decimal |
L2 chains also include:,receipt_l1_fee,receipt_l1_gas_used,receipt_l1_gas_pricereceipt_l1_fee_scalar
| 字段 | 类型 | 说明 |
|---|---|---|
| string | |
| string | |
| integer | |
| string | |
| integer | |
| integer | |
| string | |
| string | |
| decimal | ETH金额,单位为wei |
| decimal | |
| decimal | |
| string | 十六进制calldata |
| integer | |
| integer | Unix时间戳 |
| decimal | |
| string | null | 合约创建时返回该字段 |
| integer | 1 = 成功 |
| decimal |
L2链还包含以下字段:,receipt_l1_fee,receipt_l1_gas_used,receipt_l1_gas_pricereceipt_l1_fee_scalar
<chain>.blocks
<chain>.blocks<chain>.blocks
<chain>.blocks| Field | Type | Notes |
|---|---|---|
| string | |
| integer | block number |
| string | |
| string | |
| string | |
| integer | |
| integer | |
| integer | unix timestamp |
| integer | |
| integer | |
| double |
| 字段 | 类型 | 说明 |
|---|---|---|
| string | |
| integer | 区块号 |
| string | |
| string | |
| string | |
| integer | |
| integer | |
| integer | Unix时间戳 |
| integer | |
| integer | |
| double |
<chain>.erc20_transfers
<chain>.erc20_transfers<chain>.erc20_transfers
<chain>.erc20_transfers| Field | Type | Notes |
|---|---|---|
| string | |
| string | from address |
| string | to address |
| decimal | token amount |
| string | token contract address |
| integer | |
| integer | unix timestamp |
| string | |
| string | |
| integer | |
| integer |
| 字段 | 类型 | 说明 |
|---|---|---|
| string | |
| string | 发送方地址 |
| string | 接收方地址 |
| decimal | 代币金额 |
| string | 代币合约地址 |
| integer | |
| integer | Unix时间戳 |
| string | |
| string | |
| integer | |
| integer |
<chain>.erc721_transfers
<chain>.erc721_transfers<chain>.erc721_transfers
<chain>.erc721_transfers| Field | Type | Notes |
|---|---|---|
| string | |
| string | |
| string | |
| decimal | |
| string | NFT contract address |
| integer | |
| integer | unix timestamp |
| string | |
| string | |
| integer | |
| integer |
| 字段 | 类型 | 说明 |
|---|---|---|
| string | |
| string | |
| string | |
| decimal | |
| string | NFT合约地址 |
| integer | |
| integer | Unix时间戳 |
| string | |
| string | |
| integer | |
| integer |
Dataset Name Format
数据集名称格式
All datasets follow the pattern:
<chain_prefix>.<dataset_type>Examples:
- - ERC-20 transfers on Ethereum mainnet
ethereum.erc20_transfers - - All event logs on Base
base.logs - - Block data on Polygon
matic.blocks - - SPL token transfers on Solana
solana.token_transfers
所有数据集遵循以下格式:
<chain_prefix>.<dataset_type>示例:
- - 以太坊主网的ERC-20转账记录
ethereum.erc20_transfers - - Base链的所有事件日志
base.logs - - Polygon的区块数据
matic.blocks - - Solana的SPL代币转账记录
solana.token_transfers
Finding Dataset Versions
查找数据集版本
Datasets are versioned. To find available versions:
bash
goldsky dataset list | grep "base.erc20"Common versions:
- - Initial version
1.0.0 - - Enhanced schema (common for ERC-20 transfers)
1.2.0
When in doubt, use the latest version shown in .
goldsky dataset list数据集是版本化管理的。要查找可用版本:
bash
goldsky dataset list \| grep "base.erc20"常用版本:
- - 初始版本
1.0.0 - - 增强schema(ERC-20转账数据集常用)
1.2.0
如有疑问,请使用中展示的最新版本。
goldsky dataset listCommon Discovery Patterns
常见使用场景示例
"I want to track USDC transfers on Base"
"我想追踪Base链上的USDC转账"
- Dataset:
base.erc20_transfers - Filter by contract address in your pipeline transform:
yaml
transforms:
usdc_only:
type: sql
primary_key: id
sql: |
SELECT * FROM source_name
WHERE address = lower('0x833589fCD6eDb6E08f4c7C32D4f71b54bdA02913')- 数据集:
base.erc20_transfers - 在流水线转换中按合约地址过滤:
yaml
transforms:
usdc_only:
type: sql
primary_key: id
sql: |
SELECT * FROM source_name
WHERE address = lower('0x833589fCD6eDb6E08f4c7C32D4f71b54bdA02913')"I want all NFT activity on Ethereum"
"我想要以太坊上的所有NFT活动"
Dataset:
ethereum.erc721_transfers数据集:
ethereum.erc721_transfers"I want to monitor a specific smart contract"
"我想监控某个特定的智能合约"
- Dataset: for raw events, or
<chain>.logsfor decoded events<chain>.decoded_logs - Filter by contract address in your transform
- 数据集:原始事件用,解码后事件用
<chain>.logs<chain>.decoded_logs - 在转换中按合约地址过滤
"I need multi-chain data"
"我需要多链数据"
Use multiple sources in your pipeline:
yaml
sources:
eth_transfers:
type: dataset
dataset_name: ethereum.erc20_transfers
version: 1.0.0
start_at: latest
base_transfers:
type: dataset
dataset_name: base.erc20_transfers
version: 1.2.0
start_at: latest在你的流水线中使用多个数据源:
yaml
sources:
eth_transfers:
type: dataset
dataset_name: ethereum.erc20_transfers
version: 1.0.0
start_at: latest
base_transfers:
type: dataset
dataset_name: base.erc20_transfers
version: 1.2.0
start_at: latestTroubleshooting
问题排查
Dataset not found
找不到数据集
Error: Source 'my_source' references unknown dataset 'invalid.dataset'Fix:
- Check the chain prefix is correct (e.g., not
matic)polygon - Check the dataset type exists (e.g., not
erc20_transfers)erc20 - Run to see all available options
goldsky dataset list
Error: Source 'my_source' references unknown dataset 'invalid.dataset'解决方案:
- 检查链前缀是否正确(例如用而非
matic)polygon - 检查数据集类型是否存在(例如用而非
erc20_transfers)erc20 - 执行查看所有可用选项
goldsky dataset list
Chain not listed
链未列出
If you can't find a chain in the tables above:
bash
goldsky dataset list | grep -i "<chain_name>"Some chains use non-obvious prefixes (e.g., Polygon uses ).
matic如果你在上方表格中找不到对应的链:
bash
goldsky dataset list \| grep -i "<chain_name>"部分链的前缀不直观(例如Polygon使用)。
maticVersion mismatch
版本不匹配
Error: Version '2.0.0' not found for dataset 'base.erc20_transfers'Fix: Check available versions:
bash
goldsky dataset list | grep "base.erc20_transfers"Use a version that exists in the output.
Error: Version '2.0.0' not found for dataset 'base.erc20_transfers'解决方案: 检查可用版本:
bash
goldsky dataset list \| grep "base.erc20_transfers"使用输出中存在的版本即可。
Related
相关内容
- — Interactive wizard to build pipelines using these datasets
/turbo-builder
- —— 使用这些数据集构建流水线的交互式向导
/turbo-builder