Search Results: apache-spark

Found 10 Skills

Data Processingmanutej/luxor-claude-mark...

apache-spark-data-processing

Complete guide for Apache Spark data processing including RDDs, DataFrames, Spark SQL, streaming, MLlib, and production deployment

🇺🇸|EnglishTranslated

Data Processingneo4j-contrib/neo4j-skill...

neo4j-spark-skill

Use when reading from or writing to Neo4j with Apache Spark or Databricks using the Neo4j Connector for Apache Spark (org.neo4j:neo4j-connector-apache-spark). Covers SparkSession setup, DataFrame reads via labels/Cypher/relationship scan, DataFrame writes with SaveMode, node.keys for MERGE, relationship write mapping, partition and batch tuning, PySpark and Scala examples, Databricks cluster config, Databricks secrets for credentials, Delta Lake to Neo4j pipelines. Does NOT handle Cypher authoring — use neo4j-cypher-skill. Does NOT handle the Python bolt driver — use neo4j-driver-python-skill. Does NOT handle GDS algorithms — use neo4j-gds-skill.

🇺🇸|EnglishTranslated

Data Processingjeffallan/claude-skills

spark-engineer

Use when building Apache Spark applications, distributed data processing pipelines, or optimizing big data workloads. Invoke for DataFrame API, Spark SQL, RDD operations, performance tuning, streaming analytics.

🇺🇸|EnglishTranslated

Data Processingg1joshi/agent-skills

spark

Apache Spark distributed computing. Use for big data processing.

🇺🇸|EnglishTranslated

Backend Developmentmodu-ai/moai-adk

moai-lang-scala

Scala 3.4+ development specialist covering Akka, Cats Effect, ZIO, and Spark patterns. Use when building distributed systems, big data pipelines, or functional programming applications.

🇺🇸|EnglishTranslated

Data Processingborghei/claude-skills

senior-data-engineer

Expert data engineering covering data pipelines, ETL/ELT, data warehousing, streaming, and data quality.

🇺🇸|EnglishTranslated

Data Processingyaooqinn/spark-history-cl...

spark-history-cli

Query a running Apache Spark History Server from Copilot CLI. Use this whenever the user wants to inspect SHS applications, jobs, stages, executors, SQL executions, environment details, or event logs, especially when they mention Spark History Server, SHS, event log history, benchmark runs, or application IDs.

🇺🇸|EnglishTranslated

1 scripts/Checked

Data Processingpluginagentmarketplace/cu...

big-data

Apache Spark, Hadoop, distributed computing, and large-scale data processing for petabyte-scale workloads

🇺🇸|EnglishTranslated

1 scripts/Checked

Data Processingrightnow-ai/openfang

data-pipeline

Data pipeline expert for ETL, Apache Spark, Airflow, dbt, and data quality

🇺🇸|EnglishTranslated

Data Processingrohitg00/awesome-claude-c...

data-engineering

Data engineering patterns for ETL pipelines, data warehousing, Apache Spark, and data quality validation

🇺🇸|EnglishTranslated