Skip to content

Latest commit

 

History

History
119 lines (88 loc) · 7.04 KB

README.md

File metadata and controls

119 lines (88 loc) · 7.04 KB
  • 4号服务器保存路径:/home/whut-4/Desktop/HXB/05-04/poly-yolo-master

  • conda环境:poly_yolo

  • github的原版poly_yolo项目链接,gitlab最全面,有现成的h5模型,强烈建议使用gitlab仓库

  • 训练指令:

    cd /home/whut-4/Desktop/HXB/05-04/poly-yolo-master/poly_yolo
    python poly_yolo.py
    
    • 如果报cuDNN相关错误,重启计算机再训练即可解决
    • 修改训练所需文件路径:
    def get_random_data(line, input_shape, random=True, max_boxes=80, hue_alter=20, sat_alter=30, val_alter=30, proc_img=True):
      # load data
      # the color conversion is later. it is not necessary to realize bgr->rgb->hsv->rgb
      image = cv.imread('/home/whut-4/Desktop/HXB/official_poly_YOLO/poly-yolo/simulator_dataset/imgs/'+line[0])#HXB
      
    def _main():
          annotation_path = r'/home/whut-4/Desktop/HXB/05-04/poly-yolo-master/simulator_dataset/simulator-train.txt' #'train.txt'
          validation_path = r'/home/whut-4/Desktop/HXB/05-04/poly-yolo-master/simulator_dataset/simulator-val.txt' #'val.txt'
          log_dir = r'/home/whut-4/Desktop/HXB/05-04/poly-yolo-master/poly_yolo/models' #'models/'# 无法自动生成 需要手动创建
          classes_path = r'/home/whut-4/Desktop/HXB/05-04/poly-yolo-master/poly_yolo/yolo_classes.txt' #'yolo_classes.txt'
          anchors_path = r'/home/whut-4/Desktop/HXB/05-04/poly-yolo-master/poly_yolo/yolo_anchors.txt' #'yolo_anchors.txt'
    
  • 运行检测的指令:

    cd poly_yolo
     python3 hxb_infer.py 
    
  • 如果运行hxb_infer.py报错:cuDNN error 则重启计算机可解决

  • 检测数据集 、检测结果图片以及txt保存路径都在hxb_infer.py中更改

  • 05-22运行gitlab 官方pretrained模型

    • 'models/poly_yolo.h5'
    • 4号服务器保存路径:/home/whut-4/Desktop/HXB/gitlab_official_poly_YOLO/poly-yolo-master
    • 检测效果:/home/whut-4/Desktop/HXB/gitlab_official_poly_YOLO/poly-yolo-master/poly_yolo/poly_kitti_result 72 73 79 10
  • 为了输出类似张畅new.txt格式 加入代码:poly-yolo-master/poly_yolo/hxb_infer_write-txt.py

The solution based on Poly-YOLO is runner-up in Signate 3rd AI edge competition on object detection and tracking.

This is reference implementation of our ArXiv paper

What is inside?

The repository includes four folders with the following content:

  • poly_yolo: the reference implementation of Poly-YOLO

  • simulator_dataset: own synthetic dataset consisting of 700 training, 90 validation, and 100 test images with a resolution of 600x800px. The dataset is useful for fast prototyping, hyperparameter searching, or as a starting point for transfer learning because the low number of images allows fast training, and the captured scenes are trivial. It includes only a single class (a car), where its particular instances are rendered using a single 3D model. On the other hand, the scene is illuminated by physically-precise lights.

  • synthetic_dataset: Script for generating an infinite number of images, where the following parameters can be configured: the resolution of images, the number of geometric primitives per image, the type of geometric primitives, and the range of their size. It is also possible to add a random background.

  • tpami_paper: Our messy scripts used for writing the paper

Inference on Cityscapes images We present a new version of YOLO extended with instance segmentation called Poly-YOLO. Poly-YOLO builds on the original ideas of YOLOv3 and removes two of its weaknesses: a large amount of rewritten labels and inefficient distribution of anchors. Poly-YOLO reduces the amount of rewritten labels by aggregating features from a light SE-Darknet-53 backbone with hypercolumn technique, using stairstep upsampling, and produces a single scale output with high resolution. In comparison with YOLOv3, Poly-YOLO has only 60% of its trainable parameters but improves mAP by a relative 40%. We also present Poly-YOLO lite with fewer parameters and a lower output resolution. It has the same precision as YOLOv3, but it is three times smaller and twice as fast, thus suitable for embedded devices. Finally, Poly-YOLO performs instance segmentation using bounding polygons. The network is trained to detect size-independent polygons defined on a polar grid. Vertices of each polygon are being predicted with their confidence and therefore Poly-YOLO produces polygons with a varying number of vertices.

Poly-YOLO is based on the following repositories:

Requirements

  • Python 3.5.x
  • Keras 2.3.0
  • Tensorflow 1.15.0
  • Numpy, PIL, OpenCV
  • cudatoolkit 10.0
  • cuDNN compatible with cudatoolkit 10.0
  • numpy 1.16.4

Issues and TODOs

  • Rewrite PIL data augmentation into OpenCV version
  • Turn off augmentation when validation data are processed during the training
  • Add TF2.0 compatibility
  • Make script converting semantic segmentation labels into the Poly-YOLO format

A Pipeline how to run it from scratch

  1. Take a dataset and set paths to it inside poly-yolo.py
  2. (Optional) with kmeans.py generate adjusted anchors
  3. (Optional) define classes in yolo_classes.txt
  4. Run poly-yolo.py and train it. Optionally set optimizer, batch size, model resolution, nr of epochs etc.
  5. Run predictions for the best trained model

Format of data for training

Generally, YOLO uses notation of one image per line. One line includes all the boxes inside an image.

path_to\image1.jpg x1,y1,x2,y2,class,p1x,p1y,pnx,pny x1,y1,x2,y2,class,p1x,p1y,pnx,pny
path_to\image2.jpg x1,y1,x2,y2,class,p1x,p1y,pnx,pny

Where x1,y1 denote top-left of a bounding box and x2,y2 denote bottom-right. p1x,p1y ... pnx,pny are coordinates of bounding box vertices. Script labels_to_yolo_format.py converts IDD and Cityscapes dataset annotations to yolo format. The generated annotation file is put to the provided image folder. Use '--help' for script parameters description.

Example of inference

See https://gitlab.com/irafm-ai/poly-yolo/-/blob/master/poly_yolo/poly_yolo_inference.ipynb
For an illustration of the performance and comparison between the versions, see our YouYube video.