Character recognition in Japanese historical documents via adaptive multi-region model

Yueyu Wang, Sei Ichiro Kamata

研究成果: Conference contribution

1 被引用数 (Scopus)

抄録

In this work, we introduce a novel model with an adaptive multi-region extraction network to grasp multi-aspect of discriminative features, because feature inside bounding box is insufficient for classification, and normal models are sensitive to inaccuracy of predicted bounding boxes. We use the new model to recognize Japanese from historical documents. This model can be trained end-to-end without any extra supervision. The resulting CNN-based representation has abundant of features, containing the contextual information together with center part information. These features are helpful and crucial for classification. Based on this model, we also propose a data augmentation method using both local and global data distortion to generate diversified samples in order to solve the problem of data imbalance. Experiments show that with the usage of our model, we get a better result in ancient Japanese dataset.

本文言語English
ホスト出版物のタイトル2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018
出版社Institute of Electrical and Electronics Engineers Inc.
ページ404-409
ページ数6
ISBN(電子版)9781538651612
DOI
出版ステータスPublished - 2019 2月 12
イベントJoint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018 - Kitakyushu, Japan
継続期間: 2018 6月 252018 6月 28

出版物シリーズ

名前2018 Joint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018

Conference

ConferenceJoint 7th International Conference on Informatics, Electronics and Vision and 2nd International Conference on Imaging, Vision and Pattern Recognition, ICIEV-IVPR 2018
国/地域Japan
CityKitakyushu
Period18/6/2518/6/28

ASJC Scopus subject areas

  • 信号処理
  • 制御と最適化
  • 人工知能
  • コンピュータ ビジョンおよびパターン認識
  • 情報システム

フィンガープリント

「Character recognition in Japanese historical documents via adaptive multi-region model」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル