Browse Claude Code Skills — Search, Filter & Download Free

Adapting Transfer Learning Models

v1.0.0

|

Jeremy Longshore

3

Adk Agent Builder

v1.0.0

Build production-ready AI agents using Google's Agent Development Kit with Claude integration, React patterns, multi-agent orchestration, and comprehensive tool libraries

Jeremy Longshore

3

Adk Deployment Specialist

v1.0.0

|

Jeremy Longshore

5

Adk Engineer

v1.0.0

|

Jeremy Longshore

4

Fine Tuning With Trl

v1.0.0

Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward model training. Use when need RLHF, align model with preferences, or train from human feedback. Works with HuggingFace Transformers.

Orchestra Research

2

Unsloth

v1.0.0

Expert guidance for fast fine-tuning with Unsloth - 2-5x faster training, 50-80% less memory, LoRA/QLoRA optimization

Orchestra Research

5

Verl Rl Training

v1.0.0

Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexible infrastructure backends.

Orchestra Research

4

Serving Llms Vllm

v1.0.0

Serves LLMs with high throughput using vLLM's PagedAttention and continuous batching. Use when deploying production LLM APIs, optimizing inference latency/throughput, or serving models with limited GPU memory. Supports OpenAI-compatible endpoints, quantization (GPTQ/AWQ/FP8), and tensor parallelism.

Orchestra Research

4

Whisper

v1.0.0

OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M params). Use for speech-to-text, podcast transcription, or multilingual audio processing. Best for robust, multilingual ASR.

Orchestra Research

4

Long Context

v1.0.0

Extend context windows of transformer models using RoPE, YaRN, ALiBi, and position interpolation techniques. Use when processing long documents (32k-128k+ tokens), extending pre-trained models beyond original context limits, or implementing efficient positional encodings. Covers rotary embeddings, attention biases, interpolation methods, and extrapolation strategies for LLMs.

Orchestra Research

7

Miles Rl Training

v1.0.0

Provides guidance for enterprise-grade RL training using miles, a production-ready fork of slime. Use when training large MoE models with FP8/INT4, needing train-inference alignment, or requiring speculative RL for maximum throughput.

Orchestra Research

4

Model Merging

v1.0.0

Merge multiple fine-tuned models using mergekit to combine capabilities without retraining. Use when creating specialized models by blending domain-specific expertise (math + coding + chat), improving performance beyond single models, or experimenting rapidly with model variants. Covers SLERP, TIES-Merging, DARE, Task Arithmetic, linear merging, and production deployment strategies.

Orchestra Research

5

Browse Skills

Adapting Transfer Learning Models

Adk Agent Builder

Adk Deployment Specialist

Adk Engineer

Fine Tuning With Trl

Unsloth

Verl Rl Training

Serving Llms Vllm

Whisper

Long Context

Miles Rl Training

Model Merging