Search Results: checkpointing

Found 16 Skills

DevOps & Cloud Servicesmicrosoft/agent-skills

azure-eventhub-py

Azure Event Hubs SDK for Python streaming. Use for high-throughput event ingestion, producers, consumers, and checkpointing. Triggers: "event hubs", "EventHubProducerClient", "EventHubConsumerClient", "streaming", "partitions".

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningexistential-birds/beagle

langgraph-code-review

Reviews LangGraph code for bugs, anti-patterns, and improvements. Use when reviewing code that uses StateGraph, nodes, edges, checkpointing, or other LangGraph features. Catches common mistakes in state management, graph structure, and async patterns.

🇺🇸|EnglishTranslated

AI & Machine Learningexistential-birds/beagle

langgraph-implementation

Implements stateful agent graphs using LangGraph. Use when building graphs, adding nodes/edges, defining state schemas, implementing checkpointing, handling interrupts, or creating multi-agent systems with LangGraph.

🇺🇸|EnglishTranslated

Backend Developmenttursodatabase/turso

transaction-correctness

How WAL mechanics, checkpointing, concurrency rules, recovery work in tursodb

🇺🇸|EnglishTranslated

AI & Machine Learningkiterlin/intelligent-dete...

pytorch-fsdp2

Adds PyTorch FSDP2 (fully_shard) to training scripts with correct init, sharding, mixed precision/offload config, and distributed checkpointing. Use when models exceed single-GPU memory or when you need DTensor-based sharding with DeviceMesh.

🇺🇸|EnglishTranslated

AI & Machine Learningyonatangross/orchestkit

langgraph

LangGraph workflow patterns for state management, routing, parallel execution, supervisor-worker, tool calling, checkpointing, human-in-loop, streaming, subgraphs, and functional API. Use when building LangGraph pipelines, multi-agent systems, or AI workflows.

🇺🇸|EnglishTranslated

AI & Machine Learningyonatangross/orchestkit

langgraph-checkpoints

LangGraph checkpointing and persistence. Use when implementing fault-tolerant workflows, resuming interrupted executions, debugging with state history, or avoiding re-running expensive operations.

🇺🇸|EnglishTranslated

AI & Machine Learningcexll/myclaude

harness

This skill should be used for multi-session autonomous agent work requiring progress checkpointing, failure recovery, and task dependency management. Triggers on '/harness' command, or when a task involves many subtasks needing progress persistence, sleep/resume cycles across context windows, recovery from mid-task failures with partial state, or distributed work across multiple agent sessions. Synthesized from Anthropic and OpenAI engineering practices for long-running agents.

🇺🇸|EnglishTranslated

10 scripts/Attention

Documentation & Writinglingzhi227/claude-skills

paper-assembly

Orchestrate the full paper pipeline end-to-end. Manage state propagation between phases (literature → plan → code → experiments → figures → tables → writing → review), support checkpointing and resumption. Use for assembling a complete paper from components.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningmathews-tom/praxis-skills

gpu-optimizer

Expert GPU optimization for modern consumer GPUs (8-24GB VRAM). Use this skill when you need to optimize GPU training, speed up CUDA code, reduce OOM errors, tune XGBoost for GPU, migrate NumPy to CuPy, make a model faster, manage GPU memory, optimize VRAM usage, or benchmark PyTorch. Covers mixed precision, gradient checkpointing, XGBoost GPU acceleration, CuPy/cuDF migration, vectorization, torch.compile, and diagnostics. NVIDIA GPUs only. PyTorch, XGBoost, and RAPIDS frameworks.

🇺🇸|EnglishTranslated

Data Processingabsolutelyskilled/absolut...

real-time-streaming

Use this skill when building real-time data pipelines, stream processing jobs, or change data capture systems. Triggers on tasks involving Apache Kafka (producers, consumers, topics, partitions, consumer groups, Connect, Streams), Apache Flink (DataStream API, windowing, checkpointing, stateful processing), event sourcing implementations, CDC with Debezium, stream processing patterns (windowing, watermarks, exactly-once semantics), and any pipeline that processes unbounded data in motion rather than data at rest.

🇺🇸|EnglishTranslated

AI & Machine Learningbobmatnyc/claude-mpm-skil...

langgraph

LangGraph framework for building stateful, multi-agent AI applications with cyclical workflows, human-in-the-loop patterns, and persistent checkpointing.

🇺🇸|EnglishTranslated