Total 30,738 skills, Data Processing has 1471 skills
Showing 12 of 1471 skills
Generate styled word clouds from text with custom shapes, colors, fonts, and stopword filtering. Supports PNG/SVG export and frequency dictionaries.
Extract vendor, date, items, amounts, and total from receipt images using OCR and pattern matching with structured JSON output.
Auto-generate features with encodings, scaling, polynomial features, and interaction terms for ML pipelines.
Statistical scoring with z-scores, percentiles, freshness decay, and cross-category normalization. Rank and compare items with confidence scoring.
Extract structured data from 40+ websites including Amazon, LinkedIn, Instagram, TikTok, Facebook, YouTube, and more. Uses Bright Data's Web Data APIs with automatic polling. Returns clean JSON with product details, profiles, reviews, posts, and comments.
MANDATORY when working with time-series data, hypertables, continuous aggregates, or compression - enforces TimescaleDB 2.24.0 best practices including lightning-fast recompression, UUIDv7 continuous aggregates, and Direct Compress
Install ADBC (Arrow Database Connectivity) drivers with dbc. Use when the user wants to install database drivers and connect to databases.
Apache Airflow workflow orchestration. Use for data pipelines.
Apache Cassandra distributed database for high availability. Use for distributed systems.
R statistical programming for data analysis, visualization, and modeling. Use for .r files.
Use this skill for AIRR-seq (Adaptive Immune Receptor Repertoire / VDJ-seq) data analysis with immunarch + immundata in R, including ingestion, receptor schema design, immutable transformations, clonality/diversity/public overlap metrics, and Seurat/AnnData integration.
Perl text processing and scripting with regular expressions. Use for .pl files.