
Share
Ego-Exo4D v2 from Meta AI expands its video learning capabilities with enhanced annotations, adding extensive manual labels and auto-generated ground truth data for superior multimodal perception research.
Meta AI, in collaboration with the Ego4D consortium, has released an updated version of their foundational dataset, Ego-Exo4D. This new release, Ego-Exo4D v2, is a significant leap forward in video learning and multimodal perception research. The dataset now includes nearly 1,300 hours of video capture across 5,035 videos, with 221 hours of egocentric footage.
Enhanced Annotations:
Segmentation Masks:
Expert Commentary Annotations:
For researchers and practitioners in computer vision and machine learning, Ego-Exo4D v2 offers several advantages:

Data Collection:
Annotation Process:
Dataset Structure:
Benchmarks:
Ego-Exo4D v2 can be applied to various research areas, including:
The release of Ego-Exo4D v2 marks a significant milestone in the field of video learning and multimodal perception. With its rich and diverse dataset, detailed annotations, and expert commentary, this resource is poised to drive innovation and advance research in computer vision and machine learning.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
18 December 2023
133 articles
Related Articles
Related Articles
More Stories