Zero-Shot Auto-Labeling: The End of Annotation for Computer Vision with Jason Corso - #735 1q1q1o

10/06/2025

Today, we're ed by Jason Corso, co-founder of Voxel51 and professor at the University of...

Today, we're ed by Jason Corso, co-founder of Voxel51 and professor at the University of Michigan, to explore automated labeling in computer vision. Jason introduces FiftyOne, an open-source platform for visualizing datasets, analyzing models, and improving data quality. We focus on Voxel51’s recent research report, “Zero-shot auto-labeling rivals human performance,” which demonstrates how zero-shot auto-labeling with foundation models can yield to significant cost and time savings compared to traditional human annotation. Jason explains how auto-labels, despite being "noisier" at lower confidence thresholds, can lead to better downstream model performance. We also cover Voxel51's "verified auto-labeling" approach, which utilizes a "stoplight" QA workflow (green, yellow, red light) to minimize human review. Finally, we discuss the challenges of handling decision boundary uncertainty and out-of-domain classes, the differences between synthetic data generation in vision and language domains, and the potential of agentic labeling.
The complete show notes for this episode can be found at https://twimlai.com/go/735.

Grokking, Generalization Collapse, and the Dynamics of Training Deep Neural Networks with Charles Martin - #734 9 días 01:25:37 Google I/O 2025 Special Edition - #733 16 días 26:37 RAG Risks: Why Retrieval-Augmented LLMs are Not Safer with Sebastian Gehrmann - #732 24 días 57:37 From Prompts to Policies: How RL Builds Better AI Agents with Mahesh Sathiamoorthy - #731 1 mes 01:01:53 How OpenAI Builds AI Agents That Think and Act with Josh Tobin - #730 1 mes 01:07:42 Ver más en APP Comentarios del episodio 626m1c