Futuristic Car Prototype Image Generation

Overview

Our approach is based on the idea that upcoming new car designs will be similar to existing car designs in their basic structures so that designs don't lose their feasibility of implementation.

Firstly we have trained a SN-GAN on a simple car dataset with differential augmentation and LeCam regularizer. This is followed by fine-tuning of the trained GAN on a futuristic cars dataset using FreezeD technique. Further we have used Res-Net18 to extract latent codes of common and futuristic cars which are then combined using weighted average and fed into SN-GAN.

Weighted average gives us the flexibility to decide the intensity of futuristic flavor which can be added to existing designs to obtain novel ones.

Dataset Used

We use 2 datasets to train our models:

Simple Car Image Dataset: This dataset contains around 11,000 images of existing cars with a simple design, all the cars are facing left, front-left or front-right. This dataset was a subset of another dataset that was scraped from The Car Connection website and then further classified according to their view. The code to obtain this dataset was taken from following github repository: https://github.com/nicolas-gervais/predicting-car-price-from-scraped-data/tree/master/picture-scraper

Futuristic Concept Car Image Dataset: This dataset contains around 2000 images of concept cars with a futuristic design. All the images in this dataset were scraped from Google Images search results. The code for scraping was written completely by us.

SN-GAN

For steady and diverse image generation, we needed a GAN which could effectively absorb the design features of existing car models and successfully reproduce them in subsequent generations. Popular GANS architectures like DC-GAN are plagued by their training instability and inaccurate density ratio estimation by the discriminators. Therefore for our-case we used SN-GAN, a type of General Adversarial Network which uses a weight normalization method called spectral normalization to stabilize the training of discriminator networks. In image generation tasks, it has been verified that generated examples using SN-GAN are more diverse than the conventional weight normalization and achieve better or comparative inception scores relative to previous studies.

Also used LeCam Regularizer and Differential Augmentation technique to improve performance and stabilize learning dynamics.

We used StudioGAN 0.2.0 release to train the SNGAN model.
StudioGAN Repo: https://github.com/POSTECH-CVLab/PyTorch-StudioGAN

FreezeD Technique

FeezeD stands for “Freeze the Discriminator” which is a simple baseline for fine-tuning GANs. It's simply freezing the lower layers of the discriminator and only fine-tuning the upper layers performs surprisingly well. FreezeD splits the discriminator into a feature extractor and a classifier and then fine-tunes the classifier only. It’s proven to significantly outperform previous techniques used in GANs. The pre-trained part is frozen and only last layers are trained and how big the change is on the weights in a layer is governed by the learning rate.

Generating Latent Code

The train a ResNet model that can output the latent code given the desired output image. The training is done by producing a dataset using our trained GAN model, we input as many random latent codes as we want and store them along with their outputs ,then the inputs are used as outputs of our ResNet model and vice versa.

The output images of the latent codes predicted from such a ResNet are not accurate as these latent codes would only be an approximation of the real latent codes corresponding to the car images. To obtain the real latent codes corresponding to our desired car images, we pass the approximate latent codes into our GAN generator and obtain the inaccurate image and then take perceptual loss between this inaccurate image and desired image, we apply backpropagation on this perceptual loss with only latent code elements as trainable parameters.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
Car_Masking.ipynb		Car_Masking.ipynb
GAN_model.png		GAN_model.png
Generated_result.png		Generated_result.png
Kshitij-Datathon 22KTJDATV396127.pdf		Kshitij-Datathon 22KTJDATV396127.pdf
LatentCode_backPropagation.png		LatentCode_backPropagation.png
Latent_Code_ResNet.ipynb		Latent_Code_ResNet.ipynb
README.md		README.md
SNGAN_cars-train-2022_03_01_19_04_26.log		SNGAN_cars-train-2022_03_01_19_04_26.log
SNGAN_cars.json		SNGAN_cars.json
generated_canvas_None.png		generated_canvas_None.png
generated_canvas_best_fid.png		generated_canvas_best_fid.png
generated_canvas_current_fid.png		generated_canvas_current_fid.png
generated_canvas_latent_None.png		generated_canvas_latent_None.png
generated_canvas_latent_future.png		generated_canvas_latent_future.png
pipeline.png		pipeline.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Futuristic Car Prototype Image Generation

Overview

Dataset Used

SN-GAN

FreezeD Technique

Generating Latent Code

Results

About

Releases

Packages

Languages

vivekagarwal2349/KTJ-Datathon

Folders and files

Latest commit

History

Repository files navigation

Futuristic Car Prototype Image Generation

Overview

Dataset Used

SN-GAN

FreezeD Technique

Generating Latent Code

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages