Embodied intelligence, world models, and scientific clarity

I study how agents form internal worlds before they act in the physical one.

I'm Junhua Yao, a second-year computer science undergraduate exploring embodied intelligence, world models, spatial reasoning, and robust reinforcement learning for robotics.

学术精神的本质恰恰是恢复人类的天真。

For me, science begins with recovering childlike wonder.

  • Intern Researcher, TEA Lab, Tsinghua University
  • Building AI101, an open-access AI curriculum
  • Based in Guangzhou, collaborating through Beijing
  • Scholar citations: loading

Intro

Research that tries to see structure before scale

Questions that draw me are those where prediction alone feels insufficient. If an agent is going to act in the physical world, I want to know what structure it has learned, what assumptions it relies on, and how those assumptions survive contact with new bodies, tools, and environments.

Hence my focus on causality, world models, spatial intelligence, and data-efficient reinforcement learning. I care about systems that can imagine before acting, explain before overclaiming, and transfer across settings without treating every new task as a fresh start.

I also care deeply about communicating science in public. Teaching, writing, and building open materials are part of the research process for me, because clarity is not decoration. It is how we test whether an idea actually holds together.

Research

Four threads I keep returning to

Each thread starts from a practical robotics question, then moves downward into representation, abstraction, and transfer.

01. Active question

Causal structure for embodied agents

I want agents to distinguish mere correlation from genuine causal influence on outcomes.

causal representation learning, intervention-driven reasoning, physical understanding

  • Why it matters: better robustness, sharper explanations, and fewer brittle shortcuts.
  • Representative signals: Causal-PIK and recent causal world-model work aimed at physical reasoning.

02. Reading + prototyping

World models for decisions, not just next-step prediction

Prediction becomes useful to me when a model helps choose, evaluate, and revise actions under uncertainty.

decision-grounded world models, latent imagination, counterfactual planning

  • Why it matters: an agent should imagine alternatives before paying the price in the real world.
  • Representative signals: Dreamer-style imagination, JEPA-style abstraction, and evaluation work such as WorldGym.

03. Long-term direction

Spatial intelligence across bodies and scenes

I am interested in how geometric understanding and scene reasoning support manipulation, navigation, and transfer across embodiments.

3D reasoning, scene representations, spatial cognition

  • Why it matters: a capable robot should reason over space, not only over pixels or language tokens.
  • Representative signals: recent work on 4D spatial intelligence and multi-image spatial reasoning benchmarks.

04. Benchmarking focus

Data-efficient and safe RL for robotics

I care about policies that learn with fewer real-world trials and fail more gracefully when the world shifts.

model-based RL, safe exploration, sim-to-real reliability

  • Why it matters: real systems are expensive, risky, and too slow for wasteful training loops.
  • Representative directions: learned environments for control, action-constrained RL, and policy evaluation before deployment.

News

Recent signals

  1. 2025.06

    Joined TEA Lab at Tsinghua University as an intern researcher.

  2. 2025.03

    Launched AI101, an open-access AI curriculum for secondary education.

  3. 2025.02

    Completed my first surfing experience and earned the L2 certification.

  4. 2024.12

    Attended the AI Computing Technical Committee Forum at HKUST(GZ).

  5. 2024.11

    Spent the month around AI4S and embodied-intelligence forums in Shenzhen.

Outputs

Publications & Work in Progress

I do not have formal papers yet. Instead of hiding an empty list, I show the questions, prototypes, and public-facing artifacts that already shape the work.

Right now the site is a record of problem selection, reading, prototyping, and teaching. The publication list will come later. The intellectual trail should not have to wait.

Research agenda

Problem framing

Causal world models for embodied interaction

Mapping what an agent should preserve when it moves from passive prediction to intervention, planning, and explanation.

Surveying + prototyping

Spatial reasoning for manipulation and navigation

Studying how geometric representations, scene memory, and body-dependent constraints combine in long-horizon tasks.

Implementation track

Data-efficient RL for real systems

Looking for training and evaluation setups that respect safety, sample cost, and sim-to-real mismatch from the start.

Open curriculum

Launched March 2025

AI101: Pioneering AI Education for Teens

An open-access AI curriculum for secondary education, designed to make frontier ideas legible without flattening them into slogans.

Talk and slides

Public teaching material

World Models: When Machines Begin to Imagine

An accessible slide deck on world models, written for students and curious readers who want both intuition and technical grounding.

Resources

Selected materials for students and curious readers

I use public materials to translate research questions into something teachable. The homepage keeps the highlights; the resources page collects the fuller set.

Jottings

Notes, essays, and research sketches

I use short essays to think in public and keep a record of questions before they harden into positions.

2026-03-12

Diffusion Policy + RL: An Underrated Insight

Something that shouldn’t have worked… actually did? DPPO shows dramatically better sample efficiency across multiple benchmarks, and the training is stable. That’s surprising. Diffusion likelihoods are intractable; standard Policy Gradient shouldn’t work...

Experience

Where I am learning in public

TEA Lab logo

Jun. 2025 - Present, Beijing, China

Intern Researcher

TEA Lab, Tsinghua University

Working with Prof. Huazhe Xu.

Working close to embodied-intelligence research and learning how ambitious ideas survive contact with real systems, real constraints, and honest evaluation.

I

2024 - Present, Online

Writer and curriculum builder

Independent writing and teaching

Building notes, talks, and public learning materials that make AI more legible without diluting its depth.