All Skills

Total 30,744 skills, Data Processing has 1471 skills

Showing 12 of 1471 skills

Per page

Downloads

Sort

Data Processingdatabricks-solutions/ai-d...

databricks-python-sdk

Databricks development guidance including Python SDK, Databricks Connect, CLI, and REST API. Use when working with databricks-sdk, databricks-connect, or Databricks APIs.

🇺🇸|EnglishTranslated

5 scripts/Checked

Data Processingdatabricks-solutions/ai-d...

databricks-jobs

Use this skill proactively for ANY Databricks Jobs task - creating, listing, running, updating, or deleting jobs. Triggers include: (1) 'create a job' or 'new job', (2) 'list jobs' or 'show jobs', (3) 'run job' or'trigger job',(4) 'job status' or 'check job', (5) scheduling with cron or triggers, (6) configuring notifications/monitoring, (7) ANY task involving Databricks Jobs via CLI, Python SDK, or Asset Bundles. ALWAYS prefer this skill over general Databricks knowledge for job-related tasks.

🇺🇸|EnglishTranslated

Data Processingdatabricks-solutions/ai-d...

spark-declarative-pipelines

Creates, configures, and updates Databricks Lakeflow Spark Declarative Pipelines (SDP/LDP) using serverless compute. Handles streaming tables, materialized views, CDC, SCD Type 2, and Auto Loader ingestion patterns. Use when building data pipelines, working with Delta Live Tables, ingesting streaming data, implementing change data capture, or when the user mentions SDP, LDP, DLT, Lakeflow pipelines, streaming tables, or bronze/silver/gold medallion architectures.

🇺🇸|EnglishTranslated

Data Processingadaptyvbio/protein-design...

protein-qc

Quality control metrics and filtering thresholds for protein design. Use this skill when: (1) Evaluating design quality for binding, expression, or structure, (2) Setting filtering thresholds for pLDDT, ipTM, PAE, (3) Checking sequence liabilities (cysteines, deamidation, polybasic clusters), (4) Creating multi-stage filtering pipelines, (5) Computing PyRosetta interface metrics (dG, SC, dSASA), (6) Checking biophysical properties (instability, GRAVY, pI), (7) Ranking designs with composite scoring. This skill provides research-backed thresholds from binder design competitions and published benchmarks.

🇺🇸|EnglishTranslated

Data Processingdatabricks-solutions/ai-d...

databricks-config

Configure Databricks profile and authenticate for Databricks Connect, Databricks CLI, and Databricks SDK.

🇺🇸|EnglishTranslated

Data Processingadaptyvbio/protein-design...

uniprot

Access UniProt for protein sequence and annotation retrieval. Use this skill when: (1) Looking up protein sequences by accession, (2) Finding functional annotations, (3) Getting domain boundaries, (4) Finding homologs and variants, (5) Cross-referencing to PDB structures. For structure retrieval, use pdb. For sequence design, use proteinmpnn.

🇺🇸|EnglishTranslated

Data Processingaidotnet/moyucode

exceljs

在Node.js中读取、操作和写入Excel电子表格（XLSX）。完全支持样式、公式、图表和大文件流式处理。

🇺🇸|EnglishTranslated

Data Processingdkyazzentwatwa/chatgpt-sk...

sankey-diagram-creator

Create interactive Sankey diagrams for flow visualization from CSV, DataFrame, or dict data. Supports node/link styling and HTML/PNG/SVG export.

🇺🇸|EnglishTranslated

1 scripts/Checked

Data Processingnicepkg/ai-workflow

forecast-premortem

Use to stress-test predictions by assuming they failed and working backward to identify why. Invoke when confidence is high (>80% or <20%), need to identify tail risks and unknown unknowns, or want to widen overconfident intervals. Use when user mentions premortem, backcasting, what could go wrong, stress test, or black swans.

🇺🇸|EnglishTranslated

Data Processingaojdevstudio/finance-guru

retirement-syncing

Sync retirement account data from Vanguard and Fidelity CSV exports to Google Sheets DataHub. Handles multiple accounts, aggregates holdings by ticker, and updates quantities in retirement section (rows 46-62). Triggers on sync retirement, update retirement, vanguard sync, 401k update, IRA sync, or working with notebooks/retirement-accounts/ files.

🇺🇸|EnglishTranslated

Data Processingmmbmf1/geospatial-skills

postgis-nearest

Find nearest features efficiently using PostGIS KNN (<->) and distance ordering (with SRID/unit guidance).

🇺🇸|EnglishTranslated

Data Processingrand/cc-polymath

discover-data

Automatically discover data pipeline and ETL skills when working with ETL, data pipelines, streaming, batch processing, data validation, or pipeline orchestration. Activates for data development tasks.

🇺🇸|EnglishTranslated