Total 50,396 skills, Data Processing has 2557 skills
Showing 12 of 2557 skills
Create publication-quality plots and visualizations using matplotlib and seaborn. Works with ANY LLM provider (GPT, Gemini, Claude, etc.).
Expert data analysis and manipulation for customer support operations using pandas
Generate styled word clouds from text with custom shapes, colors, fonts, and stopword filtering. Supports PNG/SVG export and frequency dictionaries.
Comprehensive data quality patterns using Great Expectations, DLT expectations, and custom validators for ensuring data reliability and trust.
Process Excel files, supporting reading, analysis, statistics and export of xlsx data
读取、写入和操作Excel文件(.xlsx、.xls)。创建电子表格、读取数据并导出为各种格式。
Build apps on Databricks Apps platform. Use when asked to create dashboards, data apps, analytics tools, or visualizations. Invoke BEFORE starting implementation.
Extract tables from PDFs and images to CSV or Excel. Support for scanned documents with OCR, multi-page PDFs, and complex table structures.
Diagnose ClickHouse issues by analyzing system.part_log (part creation, merges, mutations, downloads, removals, moves). Use for too many parts / micro-batch inserts, merge backlog or slow merges, mutation storms (ALTER DELETE/UPDATE), unusual replication DownloadPart churn, unexpected RemovePart spikes, or ZooKeeper/Keeper znode growth correlated with part activity.
Split Excel workbooks into separate Excel files by worksheet, with each worksheet generating an individual file. Application scenarios: (1) Split multi-worksheet Excel files into separate files, (2) Extract specific worksheets as independent files, (3) Distribute worksheets from merged workbooks, (4) Create copies of worksheets for separate processing or distribution.
Optimize strategy parameters using VectorBT. Tests parameter combinations and generates heatmaps.
Write and debug TypeQL queries for TypeDB 3.8+. Use when working with TypeDB schemas, data queries, insertions, deletions, or functions. Covers schema definition, CRUD operations, pattern matching, aggregations, and common pitfalls.