Embodied intelligence, world models, and scientific clarity

I study how agents form internal worlds before they act in the physical one.

I'm Junhua Yao, a second-year computer science undergraduate exploring embodied intelligence, world models, spatial reasoning, and robust reinforcement learning for robotics.

学术精神的本质恰恰是恢复人类的天真。

For me, science begins with recovering childlike wonder.

Email me Google Scholar Browse resources

Intern Researcher, TEA Lab, Tsinghua University
Building AI101, an open-access AI curriculum
Based in Guangzhou, collaborating through Beijing
Scholar citations: loading

Current mode

Surfing, sometimes literally.

Intro

Research that tries to see structure before scale

Questions that draw me are those where prediction alone feels insufficient. If an agent is going to act in the physical world, I want to know what structure it has learned, what assumptions it relies on, and how those assumptions survive contact with new bodies, tools, and environments.

Hence my focus on causality, world models, spatial intelligence, and data-efficient reinforcement learning. I care about systems that can imagine before acting, explain before overclaiming, and transfer across settings without treating every new task as a fresh start.

I also care deeply about communicating science in public. Teaching, writing, and building open materials are part of the research process for me, because clarity is not decoration. It is how we test whether an idea actually holds together.

Research

Four threads I keep returning to

Each thread starts from a practical robotics question, then moves downward into representation, abstraction, and transfer.

01. Active question

Causal structure for embodied agents

I want agents to distinguish mere correlation from genuine causal influence on outcomes.

causal representation learning, intervention-driven reasoning, physical understanding

Why it matters: better robustness, sharper explanations, and fewer brittle shortcuts.
Representative signals: Causal-PIK and recent causal world-model work aimed at physical reasoning.

02. Reading + prototyping

World models for decisions, not just next-step prediction

Prediction becomes useful to me when a model helps choose, evaluate, and revise actions under uncertainty.

decision-grounded world models, latent imagination, counterfactual planning

Why it matters: an agent should imagine alternatives before paying the price in the real world.
Representative signals: Dreamer-style imagination, JEPA-style abstraction, and evaluation work such as WorldGym.

03. Long-term direction

Spatial intelligence across bodies and scenes

I am interested in how geometric understanding and scene reasoning support manipulation, navigation, and transfer across embodiments.

3D reasoning, scene representations, spatial cognition

Why it matters: a capable robot should reason over space, not only over pixels or language tokens.
Representative signals: recent work on 4D spatial intelligence and multi-image spatial reasoning benchmarks.

04. Benchmarking focus

Data-efficient and safe RL for robotics

I care about policies that learn with fewer real-world trials and fail more gracefully when the world shifts.

model-based RL, safe exploration, sim-to-real reliability

Why it matters: real systems are expensive, risky, and too slow for wasteful training loops.
Representative directions: learned environments for control, action-constrained RL, and policy evaluation before deployment.

News

Recent signals

2025.06

Joined TEA Lab at Tsinghua University as an intern researcher.
2025.03

Launched AI101, an open-access AI curriculum for secondary education.
2025.02

Completed my first surfing experience and earned the L2 certification.
2024.12

Attended the AI Computing Technical Committee Forum at HKUST(GZ).
2024.11

Spent the month around AI4S and embodied-intelligence forums in Shenzhen.

Outputs

Publications & Work in Progress

I do not have formal papers yet. Instead of hiding an empty list, I show the questions, prototypes, and public-facing artifacts that already shape the work.

Right now the site is a record of problem selection, reading, prototyping, and teaching. The publication list will come later. The intellectual trail should not have to wait.

Research agenda

Problem framing

Causal world models for embodied interaction

Mapping what an agent should preserve when it moves from passive prediction to intervention, planning, and explanation.

Surveying + prototyping

Spatial reasoning for manipulation and navigation

Studying how geometric representations, scene memory, and body-dependent constraints combine in long-horizon tasks.

Implementation track

Data-efficient RL for real systems

Looking for training and evaluation setups that respect safety, sample cost, and sim-to-real mismatch from the start.

Open curriculum

Launched March 2025

AI101: Pioneering AI Education for Teens

An open-access AI curriculum for secondary education, designed to make frontier ideas legible without flattening them into slogans.

Project repository

Talk and slides

Public teaching material

World Models: When Machines Begin to Imagine

An accessible slide deck on world models, written for students and curious readers who want both intuition and technical grounding.

View slides See resources page

Writing series

Ongoing

World Model Musings

A series of essays where I test ideas in public before they harden into claims.

Read the latest jotting Browse all jottings

Resources

Selected materials for students and curious readers

I use public materials to translate research questions into something teachable. The homepage keeps the highlights; the resources page collects the fuller set.

Open-access curriculum

Secondary students and educators

Course project

AI101 curriculum

A structured AI curriculum built to give teenage learners a serious but welcoming entry point into the field.

Open repository

Teaching deck

Undergraduate students and AI enthusiasts

Slides

World models slide deck

A public deck that links the philosophical motivation of world models to their modern technical forms.

View slides

Research notes

Readers who like ideas in progress

Essays

Jottings archive

Short essays, reading notes, and argument sketches written while questions are still alive.

Browse archive

Jottings

Notes, essays, and research sketches

I use short essays to think in public and keep a record of questions before they harden into positions.

2026-03-12

Diffusion Policy + RL: An Underrated Insight

Something that shouldn’t have worked… actually did? DPPO shows dramatically better sample efficiency across multiple benchmarks, and the training is stable. That’s surprising. Diffusion likelihoods are intractable; standard Policy Gradient shouldn’t work...

2025-09-13

World Models Musing 1: On the Origins of Thought and the Three Great Schools

World Models: On the Origins of Thought and the Three Great Schools The Primal Impulse of Intelligence When we speak of “intelligence,” our minds often leap to images of Einstein scribbling on...

2025-05-08

My CS Learning and Workflow: Harnessing LLMs, Obsidian, and Zotero

In the fast-evolving field of Computer Science, continuous learning is the norm. Daily work and study expose us to information from diverse, fragmented sources: hundreds of pages of official documentation, concise technical...

View all jottings

Experience

Where I am learning in public

Jun. 2025 - Present, Beijing, China

Intern Researcher

TEA Lab, Tsinghua University

Working with Prof. Huazhe Xu.

Working close to embodied-intelligence research and learning how ambitious ideas survive contact with real systems, real constraints, and honest evaluation.

I

2024 - Present, Online

Writer and curriculum builder

Independent writing and teaching

Building notes, talks, and public learning materials that make AI more legible without diluting its depth.