All
Search
Images
Videos
Shorts
Maps
News
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
theaisummer.com
Vision Language models: towards multi-modal deep learning | AI Summer
A review of state of the art vision-language models such as CLIP, DALLE, ALIGN and SimVL
Mar 3, 2022
Vision-Language Models for Vision Tasks: A Survey Vision-Language Models Tutorial
0:51
PTE January Prediction File is LIVE now! 🙌 Let's start the new year with motivation to crack your PTE exam in 2026! 🎯 Refer to VLE's PTE prediction file to practice the questions, which have the most chance of appearing in the exam! 💁♀️ Sign up and send us your email address to get the VIP access now! ✅🔓 #pte #ptepreparation #ptespeaking #ptewriting #ptetipsandtricks #ptetraining #vle #englishtest #studyinaustralia #pteaustralia #studyabroad #ptetest #ptemock #successstories #pteresult
TikTok
visionlanguageexperts
4.8K views
2 weeks ago
STOP Using Vision Language Models Until You Watch This | Community of Research and Development CRD
linkedin.com
3 months ago
0:16
LLMs are AI models, but not all AI models are LLMs 👀 Here are 8 specialized architectures pushing AI beyond text: 1️⃣ LCMs – concept-level (Meta SONAR) 2️⃣ VLMs – vision language 3️⃣ SLMs – small, fast edge models 4️⃣ MoE – efficient mixture of experts 5️⃣ MLMs – the OG masked models 6️⃣ LAMs – action-taking models (do tasks) 7️⃣ SAMs – pixel-level segmentation 8️⃣ LLMs – text reasoning Each is built for a purpose: speed, size, or multimodality. | Lead Gen Man
Facebook
Lead Gen Man
74.2K views
2 months ago
Top videos
What Are Vision Language Models (VLMs)? | IBM
ibm.com
10 months ago
0:30
30K views · 458 reactions | We just released 3 million samples of high quality vision language model training dataset for use cases such as: optical character recognition (OCR), visual question answering (VQA) captioning 珞 Learn more: https://nvda.ws/45NWlxm Download: https://nvda.ws/4oyle7y | NVIDIA AI | Facebook
Facebook
NVIDIA AI
8.6K views
2 weeks ago
Tackling multiple tasks with a single visual language model
deepmind.google
Apr 28, 2022
Vision-Language Models for Vision Tasks: A Survey Vision-Language Pretraining Methods
1:03:33
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Microsoft
May 4, 2020
1:20
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
Microsoft
Nov 27, 2018
0:12
In vision-and-language pretraining (VLP), objects can be used as anchor points to make aligning semantics between image-text pairs easier. Learn how Oscar, a novel VLP framework utilizing objects, sets new state of the art on six vision-and-language tasks: https://aka.ms/AA8flix | Microsoft Research
Facebook
Microsoft Research
22.5K views
May 15, 2020
What Are Vision Language Models (VLMs)? | IBM
10 months ago
ibm.com
0:30
30K views · 458 reactions | We just released 3 million samples of hig
…
8.6K views
2 weeks ago
Facebook
NVIDIA AI
Tackling multiple tasks with a single visual language model
Apr 28, 2022
deepmind.google
Vision-Language-Action Models and the Search for a Generalist Robot
…
1K views
4 months ago
substack.com
0:22
35K views · 931 reactions | Microsoft researchers have created VinVL—
…
35.8K views
3 weeks ago
Facebook
Microsoft Research
1:20
Reinforced Cross-Modal Matching and Self-Supervised Imitation Lear
…
Nov 27, 2018
Microsoft
Visual Language Intelligence and Edge AI 2.0 with NVIDIA Cosmos
…
May 3, 2024
nvidia.com
2:22
Introducing Vision Language World Model (VLWM): A foundational AI
…
33 views
4 months ago
linkedin.com
How do LLMs work with Vision AI? | OCR, Image & Video Analysis
Jun 2, 2023
Microsoft Blogs
Zachary-Cavanell
10:43
Mantis: A Versatile Vision-Language-Action Model with Dise
…
24 views
1 month ago
YouTube
AI Papers Podcast Daily
1:06
Large Language Models to Vision Language Models #artificialintellig
…
1.2K views
1 month ago
YouTube
yesotech
3:05:25
Build NanoVLM from scratch
4.7K views
1 month ago
YouTube
Vizuara
3:40
VisPlay: Self-Training Vision-Language Models
21 views
1 month ago
YouTube
AI Research Roundup
12:45
GLM-4.6V Is the Most Important Open-Source AI Release of 2025
333 views
1 month ago
YouTube
Arrotix
0:42
How Robots Really See? Introducing Open-Source, On-Devi
…
816 views
2 months ago
YouTube
Wish Lab
1:54
VLM AI Model Explained | Vision-Language Models Simplified for B
…
1 month ago
YouTube
Professor Rahul Jain
1:16:52
[AI프로그래밍 20강] Vision Language Models (VLMs) (시각-언어 모델)
139 views
2 months ago
YouTube
최용훈의 AI 강의실 (Prof. Choi’s AI Classroom)
4:31
OmniVLA: Omni-Modal Model for Robot Navigation
14 views
3 months ago
YouTube
AI Research Roundup
12:49
RynnVLA-002: A Unified Vision-Language-Action and World Mode
…
1 views
1 month ago
YouTube
AI Papers Slop
Use vision-language models to optimize object classification
10 months ago
esri.com
S1 E1: Approaching Visual Question Answering (VQA) - Vision Langua
…
10.6K views
Jul 22, 2022
YouTube
Donkey Stereotype by PrithiviDa
BRAVE: Broadening the visual encoding of vision-language mod
…
229 views
Sep 23, 2024
YouTube
Oğuzhan Fatih Kar
Learning to Prompt for Vision Language Models (Eng)
1.3K views
Aug 18, 2023
YouTube
UVLL : UNIST Vision&Learning Lab
Train a Small Language Model for Disease Symptoms | Step-by-Step
…
32.7K views
Dec 26, 2023
YouTube
AI Anytime
3:06
Visualizing and Verbalizing for Language Comprehension and Thi
…
61.6K views
Sep 23, 2014
YouTube
Gander Publishing
18:30
Cognition 2 5 Neuropsychology of Visual Perception
31.3K views
Mar 16, 2018
YouTube
Paul Merritt
14:13
How Language Shapes the Way We Think | Lera Boroditsky | TED
15.2M views
May 2, 2018
YouTube
TED
21:26
Vision Language Model VLM
20 views
1 month ago
YouTube
Reema Khan
15:08
Vision Language Models: Introduction and History
74 views
Dec 30, 2024
YouTube
IIT Madras - B.S. Degree Programme
See more videos
More like this
Feedback