'HY-World 2.0' has been released, which can generate 3D worlds for games, allowing users to create 3DCG worlds instead of videos and output them to Unity or UE5.



Tencent, a major Chinese technology company, has open-sourced 'HY-World 2.0,' a multimodal world model that can generate, reconstruct, and simulate interactive 3D worlds from text, images, and videos, as part of its '

Tencent HY ' family of general-purpose multimodal large- scale language models ( LLMs).

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds
(PDF file) https://3d-models.hunyuan.tencent.com/world/world2_0/HY_World_2_0.pdf

HY-World 2.0 is a multimodal world model that can generate, reconstruct, and simulate interactive 3D worlds from text, images, and videos. Its output can be integrated into game engines and embedded simulation pipelines.

With a single click, you can automatically convert text and images into interactive 3D worlds. It can also export editable 3D worlds for game engines such as Unity and Unreal Engine. The exported 3D worlds include standard 3D export options (mesh, 3DGS, point cloud). An interactive character mode is supported, allowing you to explore the generated 3D world in real time.




The 3D world generated by inputting the image in the lower right corner of the screen is shown below.



The following is the 3D world generated by the prompt 'Generate a retro voxel-style room with a fireplace'.



Input three images to generate a bedroom.



It's also possible to change the atmosphere of the generated 3D world with the press of a button.



HY-World 2.0 includes 'HY-Pano 2.0,' which scales up the generation of panoramas for high-fidelity 3D worlds from a single image; 'WorldNav,' which enables consistent exploration while avoiding collisions by performing semantic understanding-based trajectory planning using VLM and navigation meshes ; 'WorldStereo 2.0,' which enables stable generation of new viewpoints while maintaining spatially consistent memory through keyframe-based world expansion in latent space; 'WorldMirror 2.0,' an integrated 3D reconstruction method that can generate accurate and navigable 3DGS assets by integrating predictions from multiple viewpoints; and 'WorldLens,' a high-performance engine-independent 3DGS renderer for interactive exploration with lighting and collision handling capabilities.




HY-World 2.0 is available for download from Hugging Face.

tencent/HY-World-2.0 · Hugging Face
https://huggingface.co/tencent/HY-World-2.0



It's also available on GitHub.

GitHub - Tencent-Hunyuan/HY-World-2.0: HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds · GitHub
https://github.com/Tencent-Hunyuan/HY-World-2.0



in AI,   Video,   Software,   Game, Posted by logu_ii