Sirui Xie1* | Huanang-Gao
Weihao Gu2 | Yuejia Xu4 | Jingwei Zhao3 | Kangliang Chen2 | Xiang He2 | Hao Zhao
1: Institute for AI Industry Research, Tsinghua University
2: Haomo.ai
3: Xiaomi
4 Beijing University of Civil Engineering and Architecture
StreetDreamer a pipeline facilitates the generation of infinite 3D autonomous driving scenes conditioned on a single image of a driving log sequence. Contrary to prior methodologies that learn to generate the distribution of 3D latent representations which are subse-quently decoded or rendered—approaches that incur significant computational expense—our method strategically divides the ill-posed problem of unbounded 3D street scene generation into two distinct phases: 3D-aware novel view generation and score-guided 3D reconstruction.
- Realistic street scene generation
- High diversity in generated environments
- Integration with simulation platforms for autonomous driving
- Customizable parameters for scene variations
To install the dependencies and set up the project, please follow the instructions below:
# Clone the repository
git clone https://github.com/your-username/StreetDreamer.git
cd StreetDreamer
# Install required Python packages
pip install -r requirements.txt
We follow LucidDreamer and borrowed some of their code. Follow the instruction here to install the requirements of LucidDreamer
Our Driving scene generation pipeline consists of two stages, tuning the ControlNet model and Scene Dreaming with a conditioning image.
For the first stage, we run the first script:
bash scripts/fine_tune_controlnet.sh
Remember to modify the parameters in this bash script, such as dataset directory and checkpointing directory.
Then we personalize our trained ControlNet model to a single sequence:
bash scripts/personalize-controlnet.sh
Still, don't forget to modify the data sequence directory and other parameters
After we get the personailzed checkpoints, we use them to generate scene with the following command:
bash scripts/dream.sh
We downloaded some sequences from Waymo-Open-Dataset to train the network. Follow the instruction below to download and pre-process the dataset
bash scripts/download_waymo.sh scripts/waymo_static_32.lst ./data/waymo_tfrecords/1.4.2
This will automatically make a folder that contains the dataset in .tfrecord
format. However, our dataset needs to be pre-processed to train our model:
bash scipts/pre_process.sh
This should get all preprocessing steps done and the dataset should be ready for training
We would like to thank Haomo for their support during the development.