2024.12.18 - #20 - MASt3R-SLAM, DiTER++, CAT4D, pixelSplat, LiveScene, Talking to DINO, MV-DUSt3R, Diorama, MegaSaM, NaVILA #22

changh95 · 2024-12-12T06:42:06Z

Interesting papers

CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models

https://arxiv.org/pdf/2411.18613
https://cat-4d.github.io/
4D scene generation, multi-view video diffusion model, deformable 3D Gaussian

pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction

LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control

https://openreview.net/pdf/db46ca38beed8e31670315500fdc6d0bf0bf5757.pdf

Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation

Navigation World Models

https://www.amirbar.net/nwm/index.html

MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds

MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors

Diorama: Unleashing Zero-shot Single-view 3D Scene Modeling

MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos

https://arxiv.org/pdf/2412.04463
https://mega-sam.github.io/
deep visual SLAM framework
Check out website for better examples

U-AMC · 2024-12-18T10:13:30Z

Interesting Paper

NaVILA
2-level navigation foundation model (mid-level action VLA + locomotion skills) leveraging massive offline datasets, e.g., not only sim envs, but also human touring videos

GeJosKXWgAABhf3.mp4

yudqIzpMedfYgYIZ.mp4

Industry (?)

Figure Robot, 배송 시작

n3Am6yBhShEqDfih.mp4

Chatter

About DiTer++

https://youtu.be/RJ_netgAOT8

changh95 changed the title ~~2024.12.18 - #20 -~~ 2024.12.18 - #20 - CAT4D, pixelSplat, LiveScene, Talking to DINO, Navigation World Models, MV-DUSt3R, Diorama, MegaSaM Dec 12, 2024

changh95 changed the title ~~2024.12.18 - #20 - CAT4D, pixelSplat, LiveScene, Talking to DINO, Navigation World Models, MV-DUSt3R, Diorama, MegaSaM~~ 2024.12.18 - #20 - CAT4D, pixelSplat, LiveScene, Talking to DINO, MASt3r-SLAM, MV-DUSt3R, Diorama, MegaSaM, NaVILA Dec 18, 2024

changh95 changed the title ~~2024.12.18 - #20 - CAT4D, pixelSplat, LiveScene, Talking to DINO, MASt3r-SLAM, MV-DUSt3R, Diorama, MegaSaM, NaVILA~~ 2024.12.18 - #20 - MASt3R-SLAM, DiTER++, CAT4D, pixelSplat, LiveScene, Talking to DINO, MV-DUSt3R, Diorama, MegaSaM, NaVILA Dec 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2024.12.18 - #20 - MASt3R-SLAM, DiTER++, CAT4D, pixelSplat, LiveScene, Talking to DINO, MV-DUSt3R, Diorama, MegaSaM, NaVILA #22

2024.12.18 - #20 - MASt3R-SLAM, DiTER++, CAT4D, pixelSplat, LiveScene, Talking to DINO, MV-DUSt3R, Diorama, MegaSaM, NaVILA #22

changh95 commented Dec 12, 2024 •

edited

Loading

U-AMC commented Dec 18, 2024 •

edited

Loading

2024.12.18 - #20 - MASt3R-SLAM, DiTER++, CAT4D, pixelSplat, LiveScene, Talking to DINO, MV-DUSt3R, Diorama, MegaSaM, NaVILA #22

2024.12.18 - #20 - MASt3R-SLAM, DiTER++, CAT4D, pixelSplat, LiveScene, Talking to DINO, MV-DUSt3R, Diorama, MegaSaM, NaVILA #22

Comments

changh95 commented Dec 12, 2024 • edited Loading

Interesting papers

CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models

pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction

LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control

Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation

Navigation World Models

MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds

MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors

Diorama: Unleashing Zero-shot Single-view 3D Scene Modeling

MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos

U-AMC commented Dec 18, 2024 • edited Loading

Interesting Paper

Industry (?)

Chatter

changh95 commented Dec 12, 2024 •

edited

Loading

U-AMC commented Dec 18, 2024 •

edited

Loading