Robust Direct Preference Optimization - Search Videos

Direct Preference Optimization (DPO) explained: Bradley-Terry model, log probabilities, math

Direct Preference Optimization (DPO) explained: Bradley-Terry model, log p…

36K viewsApr 14, 2024

YouTubeUmar Jamil

Direct Preference Optimization (DPO): Your Language Model is Secretly a Reward Model Explained

Direct Preference Optimization (DPO): Your Language Model is Secretly a R…

19.4K viewsAug 10, 2023

YouTubeGabriel Mongaras

Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learning

Direct Preference Optimization (DPO) - How to fine-tune LLMs directly witho…

33.4K viewsJun 21, 2024

YouTubeLuis Serrano Academy

W12L53: Direct Preference Optimization (DPO)

W12L53: Direct Preference Optimization (DPO)

1.3K views9 months ago

YouTubeIIT Madras - B.S. Degree Programme

Direct Preference Optimization (DPO) | Paper Explained

Direct Preference Optimization (DPO) | Paper Explained

2.1K views5 months ago

Direct Preference Optimization (DPO) Explained | Train AI with Human Feedback

Direct Preference Optimization (DPO) Explained | Train AI with Human Feed…

4 views1 month ago

YouTubeTech Pulse Labs

Direct Preference Optimization Math

Direct Preference Optimization Math

74 views1 month ago

YouTubeLEARNSECTOR

Find in video from 06:49Conclusion and Future Directions

Direct Preference Optimization: Your Language Model is Secretly a Rewar…

40.5K viewsDec 22, 2023

YouTubeAI Coffee Break with Letitia

Direct Preference Optimization (DPO) in 1 hour

2.8K views8 months ago

YouTubeZachary Huang

Hands-on 10: Large Language Model Alignment with Direct Preference Opt…

3.8K views10 months ago

YouTubeBrainOmega

Direct Preference Optimization (DPO) explained + OpenAI Fine-tuning exam…

831 viewsDec 26, 2024

YouTubeSimeon Emanuilov

Lecture 40 : Aligning to User Preferences via Direct Preference Op…

467 views8 months ago

YouTubeNPTEL IIT Kharagpur

nlPUG Reading Group (April 2025) - Direct Preference Optimization

14 views2 months ago

Direct Preference Optimization (DPO) Explained: AI Alignment

13 views5 months ago

YouTubeVLR Software Training

Why Direct Preference Optimization ! Your LLM is Secretly a Reward Model…

857 views1 month ago

YouTubeTamil AI Hub

Direct Preference Optimization

820 viewsApr 9, 2024

YouTubeData Science Gems

Aligning to User Preferences via Direct Preference Optimization #swayampr…

YouTubeCH 19: IIT BOMBAY 03: Electrical Engineering

Aligning LLMs with Human Preferences

9 views3 months ago

YouTubeThe AI Opus

LLMs | Alignment of Language Models: Contrastive Learning | Lec 1…

1.7K viewsSep 26, 2024

AI Model Secrets: DPO, RLHF, and Model Merging Explained! #shorts

67 views6 months ago

YouTubeFranksWorld of AI

Deep Learning in Robust Optimization

980 viewsSep 20, 2024

YouTubeMixed Integer Programming

Rubrics as Rewards: A Technical Guide to DPO, RaR, RLVR, GPRO and …

148 views2 months ago

YouTubeByte Goose AI.

Fine-tuning and distillation with Azure AI Foundry | BRK150

6K views1 year ago

YouTubeMicrosoft Developer

Lec 10 | Reinforcement Learning from Human Feedback: Part 04

363 views7 months ago

Find in video from 07:02Direct Preference Optimization (DPO)

RLHF Explained (and DPO!)

18K viewsJun 12, 2024

YouTubeMark Hennings

Optimization Masterclass - Robust Approximation (Stochastic vs Worst-…

518 viewsMay 14, 2025

YouTubeProf Gio | Giordano Scarciotti

Holistic Robust Distributionally Robust Optimization - MIT ORC seminar 2023

1.2K viewsJun 21, 2023

YouTubeAmine Bennouna

How does DPO improve the LLM's performance? | Simple Explanation

213 viewsJan 29, 2025

Robust LLM Fine-Tuning: Rethinking Bias in LLM Alignment

180 viewsNov 13, 2024

YouTubeMachine Learning and AI Academy

6기 논문 리뷰 📎 DPO(2024.06) Direct Preference Optimization: Your Langu…

60 views7 months ago

YouTubeKMU X:AI

See more videos