123 тысяч подписчиков
98 видео
LLM Security 101: Jailbreaks, Prompt Injection Attacks, and Building Guards
Fine tune and Serve Faster Whisper Turbo
Running Llama 2 on Windows or Mac with an Intel chip
Run Code Llama on a Mac with an M1 Chip
Run Speech-to-Speech Models on Mac or GPU
Train an LLM to Self-Correct with Verifiable Backtracking
Why use Keyword vs Vector Search?
Full Fine tuning with Fewer GPUs - Galore, Optimizer Tricks, Adafactor
Data Extraction with Large Language Models
Fine-tune Multi-modal Video + Text Models
Embeddings vs Fine Tuning - Part 3: Unsupervised Fine tuning
Mistral Large vs GPT4 - Practical Benchmarking!
How to use LLMs for Fact Checking
CONTEXT CACHING for Faster and Cheaper Inference
Fine-tuning Language Models for Structured Responses with QLoRa
Fine tuning LLMs for Memorization
Tiny Text + Vision Models - Fine tuning and API Setup
Fine-tune Text to Speech Models in 2025: CSM-1B and Orpheus TTS
How does MCP work? How to use MCP?
Reasoning Models and Chinese Models
Distillation of Transformer Models
The Best LLM? Google vs OpenAI, Anthropic and DeepSeek
What LLM should I use for my application?
Advanced Data Prep and Visualisation Techniques for Fine-tuning LLMs
ARC Prize: A Guide to DSL, LLM-Guided & Test-time Training Approaches
Qwen3 Inference and MCP Agents
A Simple Postgres Logger for OpenAI Endpoints - Open Source
How to Fine-tune Florence 2: The Best Small Vision Model
How to Build an Inference Service
Fine-tune Multi-modal LLaVA Vision and Language Models
TPU vs GPU
How to Serve a Text to Speech Model with vLLM
Create a Python Sandbox for Agents to Run Code
Fine tuning Whisper for Speech Transcription
Train an ACT Policy for the SO-101 Robot with LeRobot
Fine-tune GPT-3.5-TURBO - A Crash Course
Fine tuning with Custom Compute Metrics