All
Search
Images
Videos
Shorts
Maps
News
Copilot
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Text Summarization Fast Inference
What's
Speculative Decoding
Speculative Decoding
for LLM
Transformer Models Fast Inference
Machine Translation Fast Inference
Speculative
Execution
Vllm GitHub Windows
What Is
Speculative Execution
Speculative Decoding
LLMs Explained
Text Summarization (Ts)
Openvino Docker Quick Start
Speech Recognition Fast Inference
K80 LLM Inference
Speech Recognition (Sr)
La Conception
Speculative
Transformer Models
Beam Search
LLM Draft Model
Speculative
John S Grocery and Hardware
Machine Translation (Mt)
Deep Mind
Spec Decode LLM
Machine Learning (Ml)
Mariana Internet
Sqampling in Lmmqs
Neural Networks
Artificial Intelligence (Ai)
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
Text Summarization Fast Inference
What's
Speculative Decoding
Speculative Decoding
for LLM
Transformer Models Fast Inference
Machine Translation Fast Inference
Speculative
Execution
Vllm GitHub Windows
What Is
Speculative Execution
Speculative Decoding
LLMs Explained
Text Summarization (Ts)
Openvino Docker Quick Start
Speech Recognition Fast Inference
K80 LLM Inference
Speech Recognition (Sr)
La Conception
Speculative
Transformer Models
Beam Search
LLM Draft Model
Speculative
John S Grocery and Hardware
Machine Translation (Mt)
Deep Mind
Spec Decode LLM
Machine Learning (Ml)
Mariana Internet
Sqampling in Lmmqs
Neural Networks
Artificial Intelligence (Ai)
0:54
YouTube
IndividualKex
Speculative Decoding explained
written version: https://www.adaptive-ml.com/post/speculative-decoding-visualized
5K views
3 months ago
Fast Inference from Transformers via Speculative Decoding Transformer Models
7:40
Speculative Decoding: 3× Faster LLM Inference with Zero Quality Loss
YouTube
Tales Of Tensors
709 views
4 months ago
24:17
Fast Inference from Transformers via Speculative Decoding
YouTube
Arxiv Papers
1.3K views
Sep 12, 2023
0:18
Speculative Decoding for Faster LLMs
YouTube
Zaharah
151 views
4 months ago
Top videos
14:37
Understanding Speculative Decoding: Boosting LLM Efficiency and Speed
YouTube
MLWorks
469 views
Apr 6, 2025
12:46
Speculative Decoding: When Two LLMs are Faster than One
YouTube
Efficient NLP
32.9K views
Oct 12, 2023
16:58
[IDSL Seminar'26] EdgeSD: Efficient Speculative Decoding with Vision-Decoding Disaggregation
YouTube
IDSL
1 day ago
Fast Inference from Transformers via Speculative Decoding NLP Inference Speedup
19:54
Behind the Stack, Ep. 13 - Faster Inference: Speculative Decoding for Batched Workloads
YouTube
Doubleword
81 views
5 months ago
12:18
This Simple Trick Made ALL LLMs 2x Faster
YouTube
bycloud
41K views
1 month ago
6:18
What is Speculative Sampling? | Boosting LLM inference speed
YouTube
AssemblyAI
4K views
Nov 20, 2024
14:37
Understanding Speculative Decoding: Boosting LLM Efficiency and Speed
469 views
Apr 6, 2025
YouTube
MLWorks
12:46
Speculative Decoding: When Two LLMs are Faster than One
32.9K views
Oct 12, 2023
YouTube
Efficient NLP
16:58
[IDSL Seminar'26] EdgeSD: Efficient Speculative Decoding with Vision-Decoding Disaggregation
1 day ago
YouTube
IDSL
2:42
AI Explained: Speculative decoding with vLLM
1.1K views
2 months ago
YouTube
Red Hat
1:05
What is Speculative decoding - Speculative decoding Explained #generativeai #RAG #ai #llm
309 views
1 month ago
YouTube
Med Bou | AI Tutorials
7:06
The Secret to Faster LLMs: How Speculative Decoding Works
7 views
5 months ago
YouTube
Zaharah
2:48
Speculative Decoding in 2026: What Changed
2 days ago
YouTube
Standarity
12:42
【生成式AI導論 2024】第16講:可以加速所有語言模型生成速度的神奇外掛 — Speculative Decoding
39.5K views
May 18, 2024
YouTube
Hung-yi Lee
9:39
Faster LLMs: Accelerate Inference with Speculative Decoding
22.1K views
11 months ago
YouTube
IBM Technology
22:36
MASSIVELY speed up local AI models with Speculative Decoding in LM Studio
19.8K views
Mar 5, 2025
YouTube
GosuCoder
0:54
speculative decoding explained
10.4K views
3 months ago
YouTube
IndividualKex
8:44
How to PROPERLY Use Speculative Decoding in LM Studio to DOUBLE Your AI Speed
1.9K views
3 months ago
YouTube
AsapGuide
7:40
Speculative Decoding: 3× Faster LLM Inference with Zero Quality Loss
709 views
4 months ago
YouTube
Tales Of Tensors
7:08
Speculative Decoding at Scale: Architecture and Orchestration Explained | Uplatz
13 views
2 months ago
YouTube
Uplatz
12:18
This Simple Trick Made ALL LLMs 2x Faster
41K views
1 month ago
YouTube
bycloud
40:19
Speculation is all you need: Intro to Speculative Decoding for High Performance Inference
1 views
2 months ago
YouTube
Modal
1:50
Unleashing DFlash A Game Changer in Speculative Decoding! Full Review
3 views
3 days ago
YouTube
Simple Tech Lab
17:56
Behind the Stack, Ep 11 - Speculative Decoding
70 views
6 months ago
YouTube
Doubleword
13:21
LM Studio up to 300% faster thanks to speculative decoding!
2.5K views
9 months ago
YouTube
CodeRocks & Apprendre
7:00
Speculative Decoding with OpenVINO | Intel Software
197K views
10 months ago
YouTube
Intel Devs
12:45
Speculative Decoding & Inference Speed — 2-3x Faster LLMs With Zero Quality Loss
4 days ago
YouTube
Jeff Heidelberger
19:08
Speculative Speculative Decoding (Mar 2026)
43 views
2 months ago
YouTube
AI Paper Slop
0:18
Speculative Decoding for Faster LLMs
151 views
4 months ago
YouTube
Zaharah
7:09
Don't use speculative decoding until you watch this
7 views
2 weeks ago
YouTube
DigitalOcean
1:23
Speculative Speculative Decoding for Faster LLM Inference
2.1K views
2 months ago
YouTube
Rajistics - data science, AI, and machine learning
0:46
Speculative Decoding Turbocharge Your LLM Inference! #ai, #llm, #inference, #optimization
67 views
3 months ago
YouTube
The Code Architect
15:15
How to make LLMs fast: KV Caching, Speculative Decoding, and Multi-Query Attention | Cursor Team
13.1K views
Oct 9, 2024
YouTube
Lex Clips
6:18
What is Speculative Sampling? | Boosting LLM inference speed
4K views
Nov 20, 2024
YouTube
AssemblyAI
1:03:22
BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
137 views
8 months ago
YouTube
Centre for Networked Intelligence, IISc
See more
More like this
Feedback