Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.
| Episode | Date |
|---|---|
|
MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos
|
Dec 07, 2024 |
|
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
|
Nov 12, 2024 |
|
D3RoMa: Disparity Diffusion-based Depth Sensing for Material-Agnostic Robotic Manipulation
|
Nov 11, 2024 |
|
Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers
|
Nov 09, 2024 |
|
HOVER: Versatile Neural Whole-Body Controller for Humanoid Robots
|
Nov 04, 2024 |
|
Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning
|
Nov 03, 2024 |
|
Local Policies Enable Zero-shot Long Horizon Manipulation
|
Nov 02, 2024 |
|
MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning
|
Oct 30, 2024 |
|
SkillMimicGen: Automated Demonstration Generation for Efficient Skill Learning and Deployment
|
Oct 29, 2024 |
|
LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias
|
Oct 28, 2024 |
|
Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling
|
Oct 27, 2024 |
|
SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for Long-Horizon Manipulation
|
Oct 25, 2024 |
|
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
|
Oct 24, 2024 |
|
CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics
|
Oct 23, 2024 |
|
SynFlowNet: Design of Diverse and Novel Molecules with Synthesis Constraints
|
Oct 22, 2024 |
|
L3DG: Latent 3D Gaussian Diffusion
|
Oct 21, 2024 |
|
The Ingredients for Robotic Diffusion Transformers
|
Oct 20, 2024 |
|
Estimating Body and Hand Motion in an Ego-sensed World
|
Oct 19, 2024 |
|
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation
|
Oct 19, 2024 |
|
One Step Diffusion via Shortcut Models
|
Oct 19, 2024 |
|
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
|
Oct 18, 2024 |
|
CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos
|
Oct 18, 2024 |
|
nGPT: Normalized Transformer with Representation Learning on the Hypersphere
|
Oct 18, 2024 |
|
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
|
Oct 18, 2024 |
|
Simplifying, Stabilizing and Scaling Continuous-Time Consistency Models
|
Oct 18, 2024 |