Publications
World Model on Million-Length Video And Language With Blockwise RingAttention
Hao Liu*, Wilson Yan*, Matei Zaharia, Pieter Abbeel
International Conference on Learning Representations(ICLR), 2025
[paper'25,
code,
project,
tl;dr]
ElasticTok: Adaptive Tokenization for Image and Video
Wilson Yan, Volodymyr Mnih, Aleksandra Faust, Matei Zaharia, Pieter Abbeel, Hao Liu
International Conference on Learning Representations(ICLR), 2025
[paper'25,
code,
project,
tl;dr]
Self-Questioning Language Models
Lili Chen, Mihir Prabhudesai, Katerina Fragkiadaki, Hao Liu, Deepak Pathak
arXiv preprint, 2025
[paper'25,
code,
tl;dr]
Maximizing Confidence Alone Improves Reasoning
Mihir Prabhudesai, Lili Chen, Alex Ippoliti, Katerina Fragkiadaki, Hao Liu, Deepak Pathak
arXiv preprint, 2025
[paper'25,
code,
tl;dr]
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities
Gemini Team, Google
[technical report'25]
Ring Attention with Blockwise Transformers for Near-Infinite Context
Hao Liu, Matei Zaharia, Pieter Abbeel
International Conference on Learning Representations(ICLR), 2024
[paper'24,
code,
media,
tl;dr]
Blockwise Parallel Transformer for Large Context Models
Hao Liu, Pieter Abbeel
Advances in Neural Information Processing Systems(NeurIPS)(Spotlight Presentation), 2023
[paper'23,
code,
tl;dr]
Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment
Hao Liu, Wilson Yan, Pieter Abbeel
Advances in Neural Information Processing Systems(NeurIPS), 2023
[paper'23,
code,
tl;dr]
Chain of Hindsight Aligns Language Models with Feedback
Hao Liu, Carmelo Sferrazza, Pieter Abbeel
International Conference on Learning Representations(ICLR), 2024
[paper'24,
code,
tl;dr]
Emergent Agentic Transformer from Chain of Hindsight Experience
Hao Liu, Pieter Abbeel
International Conference on Machine Learning(ICML), 2023
[paper'23,
tl;dr]
Koala: A dialogue model for academic research
Xinyang Geng*, Arnav Gudibande*, Hao Liu*, Eric Wallace*, Pieter Abbeel†, Sergey Levine†, Dawn Song†.
Blog, 2023
[blog'23]
OpenLLaMa, an open reproduction of LLaMA
Xinyang Geng*, Hao Liu*.
GitHub, 2023
[project'23,
tl;dr]
Masked Autoencoding for Scalable and Generalizable Decision Making
Fangchen Liu*, Hao Liu*, Aditya Grover, Pieter Abbeel
Advances in Neural Information Processing Systems(NeurIPS), 2022
[paper'22,
code,
tl;dr]
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Denis Yarats*, David Brandfonbrener*, Hao Liu, Michael Laskin, Pieter Abbeel, Alessandro Lazaric, Lerrel Pinto
Arxiv, 2022
[paper'22,
code,
tl;dr]
Multimodal Masked Autoencoders Learn Transferable Representations
Xinyang Geng*, Hao Liu*, Lisa Lee, Dale Schuurmans, Sergey Levine, Pieter Abbeel
ICML Pre-training Workshop (Oral Presentation), 2022.
[paper'22,
code,
tl;dr]
Palm up: Playing in the Latent Manifold for Unsupervised Pretraining
Hao Liu, Tom Zahavy, Volodymyr Mnih, Satinder Singh
Advances in Neural Information Processing Systems(NeurIPS), 2022
[paper'22,
tl;dr]
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
Michael Laskin, Hao Liu, Xue Bin Peng, Denis Yarats, Aravind Rajeswaran, Pieter Abbeel
Advances in Neural Information Processing Systems(NeurIPS), 2022
[paper'22,
project]
URLB: Unsupervised Reinforcement Learning Benchmark.
Michael Laskin, Denis Yarats, Hao Liu, Kimin Lee, Albert Zhan, Kevin Lu, Catherine Cang,
Lerrel Pinto, Pieter Abbeel
NeurIPS 2021 Track Datasets and Benchmarks, 2021
[paper'21,
code,
tl;dr]
APS: Active Pre-Training with Successor Features
Hao Liu, Pieter Abbeel
International Conference on Machine Learning(ICML)(Long Oral Presentation), 2021.
[paper'21,
code]
Behavior From the Void: Unsupervised Active Pre-Training
Hao Liu, Pieter Abbeel
Advances in Neural Information Processing Systems(NeurIPS)(Spotlight Presentation), 2021.
[paper'21,
code,
tl;dr]
Home