Replies: 1 comment 1 reply
-
你的训练集准确率已经达到 0.89,但如果希望进一步提升,可以尝试以下方法: 1. 数据增强(Data Augmentation)你的训练配置中已经包含了一些数据增强(如
2. 训练超参数调整
3. 模型结构优化
4. 训练数据优化
5. 计算资源优化
6. 调整
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
训练参数
Global:
debug: false
use_gpu: true
epoch_num: 50
log_smooth_window: 20
print_batch_step: 20
save_model_dir: ./output/rec_ppocr_v4_hgnet
save_epoch_step: 3
eval_batch_step: [0, 500]
cal_metric_during_train: true
pretrained_model: https://paddleocr.bj.bcebos.com/pretrained/ch_PP-OCRv4_rec_server_trained.pdparams
checkpoints:
save_inference_dir:
use_visualdl: true
infer_img: doc/imgs_words/ch/word_1.jpg
character_dict_path: ppocr/utils/ppocr_keys_v1.txt
max_text_length: &max_text_length 25
infer_mode: false
use_space_char: true
distributed: true
save_res_path: ./output/rec/predicts_ppocrv3.txt
Optimizer:
name: AdamW
beta1: 0.9
beta2: 0.999
lr:
name: Cosine
learning_rate: 0.0005
warmup_epoch: 5
warmup_start_lr: 1e-6
regularizer:
name: L2
factor: 1e-05
Architecture:
model_type: rec
algorithm: SVTR_HGNet
Transform:
Backbone:
name: PPHGNet_small
Head:
name: MultiHead
head_list:
- CTCHead:
Neck:
name: svtr
dims: 128
depth: 3
hidden_dims: 128
kernel_size: [3, 3]
use_guide: True
Head:
fc_decay: 0.00001
- NRTRHead:
nrtr_dim: 256
max_text_length: *max_text_length
Loss:
name: MultiLoss
loss_config_list:
- CTCLoss:
weight: 0.7
- NRTRLoss:
weight: 0.3
PostProcess:
name: CTCLabelDecode
Metric:
name: RecMetric
main_indicator: acc
Train:
dataset:
name: MSTextRecDataset
ds_width: false
data_dir: ./train_data/
ext_op_transform_idx: 1
label_file_list:
- ./train_data/train_list.txt
transforms:
- DecodeImage:
img_mode: BGR
channel_first: false
- RecConAug:
prob: 0.5
ext_data_num: 2
image_shape: [48, 320, 3]
max_text_length: *max_text_length
- RecAug:
- MultiLabelEncode:
gtc_encode: NRTRLabelEncode
- KeepKeys:
keep_keys:
- image
- label_ctc
- label_gtc
- length
- valid_ratio
sampler:
name: MultiScaleSampler
scales: [[320, 32], [320, 48], [320, 64]]
first_bs: 16
fix_bs: false
divided_factor: [16, 16] # w, h
is_training: True
loader:
shuffle: true
batch_size_per_card: 24
drop_last: true
num_workers: 8
Eval:
![5_e0hmabkz0](https://private-user-images.githubusercontent.com/939362/412715617-4b61c980-338d-4989-a9f2-18911fcebd50.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2NTQzODIsIm5iZiI6MTczOTY1NDA4MiwicGF0aCI6Ii85MzkzNjIvNDEyNzE1NjE3LTRiNjFjOTgwLTMzOGQtNDk4OS1hOWYyLTE4OTExZmNlYmQ1MC5qcGc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQyMTE0NDJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT01OTc1MjhkODI5MjgwNDY4MDIyN2JjNTJkOGZjNzAxYzI0NjNmOTUxNmZkOWU5ZTg0Y2ZlYjQzNDM4MTY0NTZlJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.RLw48MtAYGbaqia_RorCq9eVc75f7QsVb-9vVUPmR0M)
![6_2i1p704pd](https://private-user-images.githubusercontent.com/939362/412715632-8c96a7dd-c60e-4426-b7bd-c82403c8a67c.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2NTQzODIsIm5iZiI6MTczOTY1NDA4MiwicGF0aCI6Ii85MzkzNjIvNDEyNzE1NjMyLThjOTZhN2RkLWM2MGUtNDQyNi1iN2JkLWM4MjQwM2M4YTY3Yy5qcGc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQyMTE0NDJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1lNTM4ZGFkYjk5NDE4YmM4MmQ5ZTc2N2M2OTQzMTAyMTc4OTJhNjY4NjlmMDRmNjllNTg2ZjUwOWJmZDgyYjZiJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.MVW9-3QNwYCkL36dvSjazR4hUAFOC6u95THwYV9oV9s)
dataset:
name: TextRecDataset
data_dir: ./train_data
label_file_list:
- ./train_data/val_list.txt
transforms:
- DecodeImage:
img_mode: BGR
channel_first: false
- MultiLabelEncode:
gtc_encode: NRTRLabelEncode
- RecResizeImg:
image_shape: [3, 48, 320]
- KeepKeys:
keep_keys:
- image
- label_ctc
- label_gtc
- length
- valid_ratio
loader:
shuffle: false
drop_last: false
batch_size_per_card: 12
8
num_workers: 4
训练样本示例
Beta Was this translation helpful? Give feedback.
All reactions