别再手动转了！用Python脚本一键搞定COCO转YOLO格式（附完整代码与避坑点）

张

张建站

2026/5/5 12:31:28

10分钟阅读

别再手动转了！用Python脚本一键搞定COCO转YOLO格式（附完整代码与避坑点）

从COCO到YOLO高效数据格式转换实战指南每次开始新的目标检测项目时数据准备总是最耗时的环节之一。特别是当你的模型需要使用YOLO架构而手头只有COCO格式的标注数据时那种面对数百个JSON键值对的无力感相信很多开发者都深有体会。本文将带你彻底解决这个痛点不仅提供经过生产环境验证的完整Python脚本还会深入解析转换过程中的技术细节和常见陷阱。1. 理解两种标注格式的本质差异在开始编写转换脚本前我们需要清楚COCO和YOLO格式在数据结构上的根本区别。这种理解能帮助我们在遇到转换异常时快速定位问题。COCO格式使用JSON文件存储标注信息其边界框表示为[x, y, width, height]其中(x, y)表示边界框中心点的坐标width和height表示边界框的宽度和高度坐标值是绝对像素值基于原始图像尺寸而YOLO格式要求TXT文件每行表示一个对象的标注格式为class x_center y_center width height其中class是类别索引从0开始中心坐标和宽高都是相对于图像宽高的归一化值0-1之间每行对应图像中的一个对象实例关键转换公式如下def convert(size, box): # size是图像的(width, height) # box是COCO格式的[x, y, width, height] dw 1.0 / size[0] dh 1.0 / size[1] x_center (box[0] box[2]/2) * dw y_center (box[1] box[3]/2) * dh width box[2] * dw height box[3] * dh return (x_center, y_center, width, height)2. 完整转换脚本解析下面这个脚本经过了多个项目的实战检验包含了错误处理和性能优化。建议保存为coco2yolo.py#!/usr/bin/env python3 import os import json from tqdm import tqdm import argparse def parse_args(): parser argparse.ArgumentParser( descriptionConvert COCO format annotations to YOLO format) parser.add_argument(--json_path, requiredTrue, helpPath to COCO format JSON file) parser.add_argument(--save_dir, defaultlabels, helpDirectory to save YOLO format labels) parser.add_argument(--image_dir, defaultimages, helpDirectory where images are stored) return parser.parse_args() def convert_bbox(size, box): Convert COCO bbox to YOLO format dw 1.0 / size[0] dh 1.0 / size[1] x box[0] box[2] / 2.0 y box[1] box[3] / 2.0 w box[2] h box[3] x round(x * dw, 6) w round(w * dw, 6) y round(y * dh, 6) h round(h * dh, 6) return (x, y, w, h) def main(): args parse_args() # 创建保存目录 os.makedirs(args.save_dir, exist_okTrue) # 加载COCO标注 with open(args.json_path) as f: data json.load(f) # 创建类别映射文件 id_map {cat[id]: idx for idx, cat in enumerate(data[categories])} with open(os.path.join(args.save_dir, classes.txt), w) as f: f.writelines(f{cat[name]}\n for cat in data[categories]) # 创建图像路径列表文件 image_list_path os.path.join(args.save_dir, image_list.txt) # 处理每张图像的标注 with open(image_list_path, w) as list_file: for img in tqdm(data[images], descProcessing images): img_id img[id] file_name img[file_name] img_size (img[width], img[height]) # 为每张图像创建标注文件 label_name os.path.splitext(file_name)[0] .txt label_path os.path.join(args.save_dir, label_name) with open(label_path, w) as label_file: # 查找该图像的所有标注 annotations [ann for ann in data[annotations] if ann[image_id] img_id] for ann in annotations: category_id id_map[ann[category_id]] bbox convert_bbox(img_size, ann[bbox]) label_file.write( f{category_id} {bbox[0]} {bbox[1]} {bbox[2]} {bbox[3]}\n) # 写入图像路径到列表文件 rel_image_path os.path.join(args.image_dir, file_name) list_file.write(f{rel_image_path}\n) if __name__ __main__: main()脚本使用方式python coco2yolo.py --json_path /path/to/instances_train2017.json \ --save_dir ./labels \ --image_dir ./images3. 实战中的常见问题与解决方案3.1 路径问题问题现象脚本运行时报错FileNotFoundError或生成的TXT文件为空。解决方案使用绝对路径而非相对路径确保--image_dir参数与实际的图像存储位置一致检查JSON文件中file_name字段的路径格式# 路径处理最佳实践 image_path os.path.normpath(os.path.join(args.image_dir, file_name))3.2 类别ID不连续问题COCO数据集的类别ID通常不连续如1,2,3,5...直接使用会导致YOLO训练出错。解决方法建立新的连续ID映射id_map {} for idx, cat in enumerate(data[categories]): id_map[cat[id]] idx # 映射到0开始的连续ID3.3 归一化精度问题YOLO要求坐标值保留6位小数但直接计算可能导致精度损失。优化方案使用Python的decimal模块进行高精度计算from decimal import Decimal, getcontext getcontext().prec 8 # 设置足够高的精度 def convert_bbox(size, box): dw Decimal(1) / Decimal(size[0]) dh Decimal(1) / Decimal(size[1]) # 其余计算保持不变...4. 性能优化技巧当处理大型COCO数据集如train2017时原始脚本可能运行缓慢。以下是几个优化点4.1 使用字典加速标注查找# 建立image_id到annotations的映射 from collections import defaultdict img_ann_map defaultdict(list) for ann in data[annotations]: img_ann_map[ann[image_id]].append(ann) # 使用时直接查询 annotations img_ann_map.get(img_id, [])4.2 多进程处理对于超大数据集可以使用Python的multiprocessing模块from multiprocessing import Pool def process_image(args): img, img_ann_map, id_map args # 处理单张图像的逻辑... if __name__ __main__: with Pool(processes4) as pool: # 使用4个进程 args_list [(img, img_ann_map, id_map) for img in data[images]] pool.map(process_image, args_list)4.3 内存优化对于特别大的JSON文件可以使用ijson库进行流式解析import ijson def parse_large_json(json_path): with open(json_path, rb) as f: # 流式处理images和annotations images ijson.items(f, images.item) for img in images: # 处理每张图像...5. 与训练流程的集成转换后的数据需要正确集成到YOLO训练流程中。关键步骤包括目录结构准备dataset/ ├── images/ │ ├── train/ │ └── val/ └── labels/ ├── train/ └── val/创建YOLO格式的配置文件# coco.yaml train: ../dataset/images/train val: ../dataset/images/val nc: 80 # 类别数 names: [person, bicycle, car, ..., toothbrush] # 类别名称验证数据加载import cv2 import random def visualize_label(image_path, label_path): img cv2.imread(image_path) dh, dw img.shape[:2] with open(label_path) as f: for line in f.readlines(): class_id, x, y, w, h map(float, line.split()) # 转换回像素坐标 x int((x - w/2) * dw) y int((y - h/2) * dh) w int(w * dw) h int(h * dh) # 绘制边界框...在实际项目中我发现将转换脚本集成到数据预处理流水线中能显著提高效率。可以添加--dry_run参数来验证转换逻辑而不实际写入文件这在调试阶段特别有用。另一个实用技巧是在脚本中添加验证模式自动检查转换后的标注是否与原始图像匹配

企业如何利用Taotoken的访问控制与审计日志管理内部AI资源使用

企业如何利用Taotoken的访问控制与审计日志管理内部AI资源使用 1. 企业AI资源管理的核心挑战在企业内部部署大模型API时，资源分配与权限管理往往面临多重挑战。不同部门或项目组对模型能力的需求差异较大，开发团队可能同时测试多个模型版本&#xff0…...

2026/5/5 12:30:38 阅读更多 →

OpenIM Server企业级部署终极指南：从零构建高可用即时通讯平台

OpenIM Server企业级部署终极指南：从零构建高可用即时通讯平台【免费下载链接】open-im-server IM Chat OpenClaw 项目地址: https://gitcode.com/gh_mirrors/op/open-im-server OpenIM Server是一个开源的企业级即时通讯解决方案，支持千万级并发…...

2026/5/5 12:30:01 阅读更多 →

FanControl终极指南：5分钟搞定Windows风扇调速与散热控制

FanControl终极指南：5分钟搞定Windows风扇调速与散热控制【免费下载链接】FanControl.Releases This is the release repository for Fan Control, a highly customizable fan controlling software for Windows. 项目地址: https://gitcode.com/GitHub_Trending…...

2026/5/5 12:27:33 阅读更多 →

Autovisor：终极自动化学习助手 - 5分钟快速上手智慧树刷课教程

Autovisor：终极自动化学习助手 - 5分钟快速上手智慧树刷课教程【免费下载链接】Autovisor 2025智慧树刷课脚本基于Python Playwright的自动化程序 [有免安装版] 项目地址: https://gitcode.com/gh_mirrors/au/Autovisor 你是否厌倦了每天手动点击播放、等待…...

2026/5/5 12:09:26 阅读更多 →

ModelScope Auto Proxy：智能路由网关，零成本统一调用免费大模型API

1. 项目概述与核心价值如果你和我一样，是个重度依赖 AI 编程工具（比如 Cursor、Cline）的开发者，那你肯定对 OpenAI 的 API 调用成本又爱又恨。爱的是它强大的能力，恨的是账单上的数字。最近，国内的开源社…...

2026/5/5 13:13:36 阅读更多 →

从零到一：手把手教你用BetaFlight CLI命令配置AOCODARC H7DUAL飞控板（保姆级教程）

从零到一：手把手教你用BetaFlight CLI命令配置AOCODARC H7DUAL飞控板（保姆级教程） 当你第一次拿到AOCODARC H7DUAL这块飞控板时，可能会被密密麻麻的引脚和复杂的配置选项吓到。别担心，这篇教程将带你从零开始&#xff…...

2026/5/5 3:26:31 阅读更多 →

League Akari：你的英雄联盟游戏体验进化指南

League Akari：你的英雄联盟游戏体验进化指南【免费下载链接】League-Toolkit An all-in-one toolkit for LeagueClient. Gathering power 🚀. 项目地址: https://gitcode.com/gh_mirrors/le/League-Toolkit 想象一下这样的场景：你正在…...

2026/5/5 9:51:58 阅读更多 →