Skip to content

Latest commit

 

History

History
78 lines (62 loc) · 2.77 KB

ReleaseNote.md

File metadata and controls

78 lines (62 loc) · 2.77 KB

(1) Synthetic Scene Text Dataset from 3D World

Lanuages Num of Images Num of Text Baidu Drive Google Drive
English/Latin 728K ~20M Link password: 2h8d Link
Multilingual 674K ~18M Link password: tddl Link

The multilingual version consists of the following 10 languages: Arabic, English, French, Chinese, German, Korean, Japanese, Italian, Bangla, Hindi

Both datasets are very large (~150GB). Therefore, I split them into "several" files (~130). They are organzied as follows:

./
+---sub_0
    +---imgs
    |   0.jpg
    |   1.jpg
    |   ...
    |
    +---labels
    |   0.json
    |   1.json
    |   ...
    |
+---sub_1
+---sub_2
+---sub_3
...
+---sub_100
...

The labels are stored in the following format:

{
    "imgfile":str path to the corresponding image file, e.g. "imgs/0.jpg",
    "bbox": List[
                word_i(8 float):[x0, y0, x1, y1, x2, y2, x3, x4] 
                (from upper left corner, clockwise),
            ],
    "cbox": List[
                char_i(8 float):[x0, y0, x1, y1, x2, y2, x3, x4] 
                (from upper left corner, clockwise),
            ],
    "text": List[str]
}

(2) Demo UE Project(s)

Scene Name Baidu Drive Google Drive
Realistic Rendering Link password: wgja Link

How-to:

  1. download and uncompress the project
  2. in UE4.22, load the following file: Demo/Demo.uproject

(3) UnrealText resources

Resources Baidu Drive Google Drive
background images Link password: 3x3r Link
fonts & corpus Link password: ip8w Link

(4) Packaged Scene Executables

Scenes Baidu Drive Google Drive
All 30 scene executables Link password: z3af Link

How-to:

  1. download and uncompress the project
  2. cd to $Name/$Name/Binaries/Linux/, and double-click the executable ./Demo
  3. alternatively, you can launch it in terminal, ./$Name/$Name/Binaries/Linux/Demo