Search Results: dataset

Found 331 Skills

tabular-eda

Profile a new tabular dataset before modeling. Find target leakage, missing data patterns, high-cardinality categoricals, near-constant features, redundant pairs, and non-linear relationships that Pearson correlation misses. Use whenever the user hands you a CSV or parquet and asks "what should I do with this?" Always run this skill before training any model on data you haven't seen before.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningnvidia/skills

vss-generate-video-calibration

Use to run AutoMagicCalib on local MP4s, RTSP, or the bundled sample dataset, and to deploy vss-auto-calibration when needed. Not for non-AMC calibration or runtime analytics.

🇺🇸|EnglishTranslated

Data Processingastronomer/agents

profiling-tables

Deep-dive data profiling for a specific table. Use when the user asks to profile a table, wants statistics about a dataset, asks about data quality, or needs to understand a table's structure and content. Requires a table name.

🇺🇸|EnglishTranslated

Backend Developmentaj-geddes/useful-ai-promp...

background-job-processing

Implement background job processing systems with task queues, workers, scheduling, and retry mechanisms. Use when handling long-running tasks, sending emails, generating reports, and processing large datasets asynchronously.

🇺🇸|EnglishTranslated

Data Processingdkyazzentwatwa/chatgpt-sk...

dataset-comparer

Compare two datasets to find differences, added/removed rows, changed values. Use for data validation, ETL verification, or tracking changes.

🇺🇸|EnglishTranslated

1 scripts/Checked

Frontend Developmentponpon55837/mariokartworl...

react-table-search-pagination

Complete Development Guide for Tables, Search, and Pagination Features in React/Next.js Projects. Covers core technologies such as race condition handling, search system implementation, pagination systems, infinite scrolling, CRUD synchronization, Intersection Observer API, and state management selection. Key Features: - Handle race condition issues in asynchronous requests - Implement high-performance search and autocomplete features - Build professional-grade pagination systems and caching strategies - Develop smooth infinite scrolling experiences - Ensure data consistency for CRUD operations - Select the most suitable state management solution Applicable Scenarios: - React/Next.js applications requiring search and pagination features - List display and CRUD operations for large datasets - Need for high-performance infinite scrolling or virtualized lists - Facing complex data management issues such as race conditions and state synchronization - Projects needing to select an appropriate state management solution

🇨🇳|ChineseTranslated

Data Processingphuryn/pm-skills

dummy-dataset

Generate realistic dummy datasets for testing with customizable columns, constraints, and output formats (CSV, JSON, SQL, Python script). Use when creating test data, building mock datasets, or generating sample data for development and demos.

🇺🇸|EnglishTranslated

Automationnstbrowser/nstbrowser-ai-...

nstbrowser-ai-agent

Browser automation CLI with Nstbrowser integration for AI agents. Use when the user needs advanced browser fingerprinting, profile management, proxy configuration, batch operations on multiple browser profiles, or cursor-based pagination for large datasets. Triggers include requests to "use NST profile", "configure proxy for profile", "manage browser profiles", "batch update profiles", "start multiple browsers", "list profiles with pagination", or any task requiring Nstbrowser's anti-detection features.

🇺🇸|EnglishTranslated

3 scripts/Attention

Frontend Developmentantvis/l7

antv-l7

Comprehensive guide for AntV L7 geospatial visualization library. Use when users need to: (1) Create interactive maps with WebGL rendering (2) Visualize geographic data (points, lines, polygons, heatmaps) (3) Build location-based data dashboards (4) Add map layers, interactions, or animations (5) Process and display GeoJSON, CSV, or other spatial data (6) Integrate maps with AMap (GaodeMap), Mapbox, Maplibre, or standalone L7 Map (7) Optimize performance for large-scale geographic datasets

🇺🇸|EnglishTranslated

Data Processingwithqwerty/nutmeg

nutmeg-wrangle

Transform, filter, reshape, join, and manipulate football data. Use when the user needs to clean data, merge datasets, convert between formats, handle missing values, work with large datasets, or do any data manipulation task on football data.

🇺🇸|EnglishTranslated

Data Processingagentic-reserve/blockint-...

blockchain-spider-toolkit

Points to the BlockchainSpider open-source Python/Scrapy toolkit for collecting on-chain data—transfer subgraphs around an address or tx, EVM and Solana block/transaction ingestion, receipts/logs, and optional label plugins. Use when the user wants to build datasets, offline traces, or research pipelines alongside blockchain-analytics-operations and solana-tracing-specialist—not as a substitute for RPC provider ToS, rate limits, or legal review of sensitive crawls.

🇺🇸|EnglishTranslated

AI & Machine Learningvllm-project/vllm-skills

vllm-bench-random-synthetic

Run vLLM performance benchmark using synthetic random data to measure throughput, TTFT (Time to First Token), TPOT (Time per Output Token), and other key performance metrics. Use when the user wants to quickly test vLLM serving performance without downloading external datasets.

🇺🇸|EnglishTranslated