Solution for LPCVC 2023

Team Members

Student at Telecom Paris - First-year final project

Project Description

The LOW-POWER COMPUTER VISION CHALLENGE 2023 project is a competition focused on achieving efficient and accurate understanding of disaster scenes using low-power edge devices for computer vision. The main objective of this competition is to improve semantic segmentation on an embedded device (NVIDIA Jetson Nano 2GB Developer Kit) using PyTorch models. Participants will develop models capable of automatically analyzing images captured by unmanned aerial vehicles (UAVs) in disaster-stricken areas.

UAVs equipped with inexpensive sensors provide imagery of disaster areas that are difficult to access for humans. However, their processing capability is limited due to energy-constrained resources and low-compute devices, resulting in delays in analysis and longer response times for relief operations. The competition aims to promote the use of on-device computer vision on UAVs, addressing challenges related to power consumption and latency.

Objective

The main objective of this project is to develop efficient semantic segmentation models for automatically analyzing disaster scenes from UAV-captured images. We will use the NVIDIA Jetson Nano 2GB Developer Kit as the embedded device to run the developed models.

Requirements

Python 3.6
PyTorch 1.11.1
NVIDIA Jetson Nano 2GB Developer Kit
JetPack SDK 4.6.3

Installation

Clone this GitHub repository to your local machine:

git clone https://github.com/SunJacques/LPCVC-2023.git

Install the necessary dependencies:

pip install -r requirements.txt

Install JetPack on your NVIDIA Jetson Nano device to ensure you have TensorRT installed. You can follow the official NVIDIA JetPack installation guide for detailed instructions on how to install JetPack on your Jetson Nano.

Dataset

The dataset used in this competition consists of 1,700 samples of images collected by UAVs in disaster-affected areas. The dataset will be provided to the participants and should be placed in the dataset/ directory at the root of the project.

├── dataset
│   ├── train
│   │    ├── IMG
│   │    │   ├── train_0000.png
│   │    │   ├── train_0001.png
│   │    │   └── ...
│   │    └── GT
│   │        ├── train_0000.png
│   │        ├── train_0001.png
│   │        └── ...
│   │   
│   │   
│   └── val
│       ├── IMG
│       │   ├── val_0000.png
│       │   ├── val_0001.png
│       │   └── ...
│       └── GT
│           ├── val_0000.png
│           ├── val_0001.png
│           └── ...
├── README.md
├── train.py
└── ...

The training and validation data can be downloaded from here.

Model Training

To train the model, run the training script train.py:

python train.py --datapath dataset/ --epochs 100

The trained model will be saved in the checkpoints/ directory.

Model Evaluation

To evaluate the model, run the evaluation script evaluate.py:

python evaluate.py --modelpath checkpoint/model.pth --datapath dataset/

The evaluation results will be displayed on the terminal.

You can also evaluate the model on the NVIDIA Jetson Nano by using the evaluate_nano.py script:

python evaluate_nano.py --modelpath checkpoint/model.pth --datapath dataset/

Make sure you have installed JetPack on your Jetson Nano to ensure compatibility with TensorRT.

Submission

To submit your solution you have to zip it first : solution.pyz: the zipped package of solution/. zipapp should be used to compress the package.

Recommended command where solution is the name to your directory: python3.6 -m zipapp solution -p='/usr/bin/env python3.6'

Results

Architecure	Backbone	Dice	Inference time on A100 (ms)	Inference time on Jetson Nano (ms)	Score
UNet (128, no augmentation)	/	0.43	0.87	/	/
UNet (128)	/	0.52	0.85	/	/
UNet (128)	ImageNet	0.55	0.55	/	/
UNet (256)	ImageNet	0.6	0.65	/	/
UNet (256)	MobileNetV2	0.65	0.94	/	/
UNet (128)	EfficientNetB4	0.6	1.4	/	/
FPN (128)	EfficientNetB3	0.63	0.8	36	18
FPN (256)	EfficientNetB3	0.65	1.7	/	/
FPN (256)	EfficientNetB0	0.63	1.1	/	/
FPN (256)	MobileNetV2	0.65	0.8	50	13
FastCNN (256)	/	0.4	0.37	/	/
PSPNet (256)	MobileNetV2	0.55	0.28	14	39
PSPNET (256)	MobileNetV3	0.54	1	/	/

Daily Report

Tuesday:

We researched existing architectures and decided to start with the UNet architecture. We also began familiarizing ourselves with PyTorch.

Wednesday:

We coded UNet architecture from scratch and trained the model for the first time. We then enhanced the model performance by resizing the dataset in 128x128 with less layer (4) for Unet. We tested the model with another optimizer, SGD with a different learning rate but the results are not optimistic. We also researched existing backbones to improve the performance of the model.

Thursday:

We implemented the following backbones: ResNet, EfficientNet, MobileNet, and ImageNet. We also tried to code an AutoEncoder to improve our current model. Then, we tested a new architecture (FPN) with different learning rates for the FPN. We also added a One Cycle scheduler and augmented the images in the dataset. We achieved a satisfactory dice score with FPN, but the FastCNN architecture allows lower inference time and utilizes less memory (700Mo). Unfortunately, the obtained dice score is not satisfactory (0.38). The new objective is, therefore, to start from the FastCNN model and attempt to improve its dice score.

Friday:

We enhanced the inference time by implementing quantisation and dice for FastCNN, and retrained the model with a dataset of data size 256x256 but the results are not satisfying. We then turned to another promising model: PSPNet. We started using Jetson Nano and tried to load our model on the card, but encountered numerous errors.

Saturday:

We optimized the PSPNet by testing different backbones(MobileNet, ResNet).

Monday:

We tried to solve the problem with Jetson Nano in our code, and loaded the models we trained. We obtained an inference time of 50ms with PSPNet. We also encountered the problem that the first inference time was too long, however, it was enhanced in the process of inferences.

Tuesday:

We searched for ways to optimize our model on the Jetson Nano device. Consequently, we utilized TensorRT to enhance the inference time. As a result, the PSPNet model achieved an inference time of 16ms.
To further improve this inference time, we decided to use quantization and we thus achieved an inference time of 14ms.

Wednesday:

We looked for new models, read corresponding papers and focused on a paper published last May on a new architecture JetSeg, and adapted the code to our study case. We did the same for ESPNet architecture.

License

This project is distributed under the MIT License. Please see the LICENSE file for more information.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Solution for LPCVC 2023

Team Members

Project Description

Objective

Requirements

Installation

Dataset

Model Training

Model Evaluation

Submission

Results

Tuesday:

Wednesday:

Thursday:

Friday:

Saturday:

Monday:

Tuesday:

Wednesday:

License

Additional Resources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
checkpoint		checkpoint
solution		solution
src		src
.gitignore		.gitignore
README.md		README.md
evaluate.py		evaluate.py
evaluate_nano.py		evaluate_nano.py
image.py		image.py
requirements.txt		requirements.txt
solution.pyz		solution.pyz
train.py		train.py

SunJacques/LPCVC-2023

Folders and files

Latest commit

History

Repository files navigation

Solution for LPCVC 2023

Team Members

Project Description

Objective

Requirements

Installation

Dataset

Model Training

Model Evaluation

Submission

Results

Tuesday:

Wednesday:

Thursday:

Friday:

Saturday:

Monday:

Tuesday:

Wednesday:

License

Additional Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages