Vision Language Action Model

Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight

Mantis is a versatile vision-language-action model that empowers robots to perform complex manipulation tasks through innovative disentangled visual foresight, progressive training, and adaptive temporal integration mechanisms. The key features of Mantis i

1 PowerPoint Slides Skill: Create visually rich PowerPoint (.pptx) presentations with native OMML math, LaTeX formulas, and Graphviz/Mermaid/TikZ diagrams

2 FinClaw: The First Open-Source Finance-Dedicated Lobster with 1000+ Financial Skills Fully Free

3 PPT Agent: A code-driven presentation generation framework

4 NanoResearch: Autonomous AI Research Assistant

5 CitationClaw: A Lightweight Engine for Discovering Scientific Impact through Citations

6 pi-autoresearch: Autonomous experiment loop extension for pi

7 OpenClaw Control Center: safety-first local control center for OpenClaw

8 OpenClaw Dashboard: A lightweight web dashboard for viewing all your OpenClaw Bots/Agents/Models/Sessions status at a glance

9 PaperBanana-CN: an academic paper illustration generation tool

10 996.ICU: Work by '996', sick in ICU

Favorites

1 Agent Tools: CLI tools for coding agents