PaddleOCR2.6训练ICDAR2015数据集图片找不到路径问题，相对路径转绝对路径

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

项目地址：https://gitcode.com/gh_mirrors/pa/PaddleOCR

免费下载资源

天地立心i

583人浏览 · 2023-07-22 14:47:30

天地立心i · 2023-07-22 14:47:30 发布

在使用百度飞桨下PaddleOCR2.6训练识别部分时，官方教程在项目下./doc/doc_ch/recognition.md,详细介绍了识别训练，具体过程可以参照此md文件

说一下遇到的问题，ICDAR2015数据格式如下：

两个out.txt文件是输出的，原有的.txt文件虽然集合了图片路径和标注信息，但给的路径是相对路径，在训练识别部分时会报错，imgs does not exist 很让人崩溃是不是。

在官方转换数据格式代码的基础上做出修改，路径在./ppocr/utils/gen_label.py,官方代码：

import os
import argparse
import json


def gen_rec_label(input_path, out_label):
    with open(out_label, 'w') as out_file:
        with open(input_path, 'r') as f:
            for line in f.readlines():
                tmp = line.strip('\n').replace(" ", "").split(',')
                img_path, label = tmp[0], tmp[1]
                label = label.replace("\"", "")
                out_file.write(img_path + '\t' + label + '\n')


def gen_det_label(root_path, input_dir, out_label):
    with open(out_label, 'w') as out_file:
        for label_file in os.listdir(input_dir):
            img_path = root_path + label_file[3:-4] + ".jpg"
            label = []
            with open(
                    os.path.join(input_dir, label_file), 'r',
                    encoding='utf-8-sig') as f:
                for line in f.readlines():
                    tmp = line.strip("\n\r").replace("\xef\xbb\xbf",
                                                     "").split(',')
                    points = tmp[:8]
                    s = []
                    for i in range(0, len(points), 2):
                        b = points[i:i + 2]
                        b = [int(t) for t in b]
                        s.append(b)
                    result = {"transcription": tmp[8], "points": s}
                    label.append(result)

            out_file.write(img_path + '\t' + json.dumps(
                label, ensure_ascii=False) + '\n')


if __name__ == "__main__":
    parser = argparse.ArgumentParser()
    parser.add_argument(
        '--mode',
        type=str,
        default="rec",
        help='Generate rec_label or det_label, can be set rec or det')
    parser.add_argument(
        '--root_path',
        type=str,
        default=".",
        help='The root directory of images.Only takes effect when mode=det ')
    parser.add_argument(
        '--input_path',
        type=str,
        default="D:/Learn/paddleocr2.6/train_data/icdar2015/text_localization/test_icdar2015_label.txt",
        help='Input_label or input path to be converted')
    parser.add_argument(
        '--output_label',
        type=str,
        default="out_rec_test_label.txt",
        help='Output file name')

    args = parser.parse_args()
    if args.mode == "rec":
        print("Generate rec label")
        gen_rec_label(args.input_path, args.output_label)
    elif args.mode == "det":
        gen_det_label(args.root_path, args.input_path, args.output_label)

修改之后的代码：

import os
import argparse
import json


def collect_paths_and_annotations(input_path, output_label, mode):
    with open(output_label, 'w') as out_file:
        with open(input_path, 'r') as f:
            for line in f.readlines():
                tmp = line.strip().split('\t')
                img_path, annotations = tmp[0], tmp[1]
                img_path = os.path.abspath(img_path)  # Convert to absolute path
                img_path = img_path.replace("\\", "/")  # Convert to Unix-style path

                if mode == "rec":
                    # In rec mode, annotations are the labels directly
                    out_file.write(f"{img_path}\t{annotations}\n")
                elif mode == "det":
                    # In det mode, annotations are in JSON format, so we parse them
                    annotations = json.loads(annotations)
                    annotations_str = json.dumps(annotations, ensure_ascii=False)
                    out_file.write(f"{img_path}\t{annotations_str}\n")


if __name__ == "__main__":
    parser = argparse.ArgumentParser()
    parser.add_argument(
        '--mode',
        type=str,
        default="rec",
        help='Generate rec_label or det_label, can be set rec or det')
    parser.add_argument(
        '--input_path',
        type=str,
        default="D:/Learn/paddleocr2.6/train_data/icdar2015/text_localization/train_icdar2015_label.txt",
        help='Input_label or input path to be converted')
    parser.add_argument(
        '--output_label',
        type=str,
        default="D:/Learn/paddleocr2.6/train_data/out_rec_train_label.txt",
        help='Output file name')

    args = parser.parse_args()
    collect_paths_and_annotations(args.input_path, args.output_label, args.mode)

这样把原有的相对路径.txt文件转换成绝对路径

问题解决！

顺便一提opencv-python版本兼容的问题，最新版本opencv会和项目有一定冲突，经过实验，发现以下版本最好

pip install opencv-python==4.6.0.66

顺利训练！

GitHub 加速计划 / pa / PaddleOCR

41.51 K

7.59 K

下载

最近提交(Master分支：1 个月前 )

ac5313d0 3 天前

284a20bf 3 天前

GitCode 开源社区

旨在为数千万中国开发者提供一个无缝且高效的云端环境，以支持学习、使用和贡献开源项目。

更多推荐

[转载]在Windows环境下安装GNU Radio

转自：在Windows环境下安装GNURadio_恐弱智_新浪博客GNU Radio是用Python开发的，大部分开源的工程能够在Linux环境下运行良好，而Windows下却运行的很勉强，而且安装配置都很复杂。GNU Radio算是个例外了，不光提供了Windows的二进制安装，还有比较详细的说明。我是Python小白，所以折腾了好久才弄好，特意记录下来，免得以后再装还折腾。GNU Radio的

GitCode 开源社区

centOS 8 使用dnf安装Docker

DNF是什么？CentOS 8使用YUM软件包管理器版本v4.0.4。现在，该版本使用DNF(已删除YUM)。DNF是软件包管理器。它会在Linux发行版上安装，执行更新并删除软件包。使用DNF安装Docker跳过具有损坏依赖性的程序包一个有效的解决方案是使您的CentOS 8系统使用以下--nobest命令安装最符合条件的版本：sudo dnf install docker...

GitCode 开源社区

定时同步数据库表(mysql+linux+crontab)

sync.sh里面的参数需要改变，ip/username/password/database/tablesync.sh#!/bin/sh# Please change the IP and password of the data source db.# Then change the table name.filename=/home/nington/db/$(date +%Y-%m