告别繁琐配置！Spring Batch注解式开发入门：5分钟搭建你的第一个文件批处理Job

张

张建站

2026/5/1 3:04:24

10分钟阅读

告别繁琐配置Spring Batch注解式开发入门5分钟搭建你的第一个文件批处理Job批处理任务在企业级应用中无处不在——从每日的报表生成、数据清洗到大规模日志分析。传统Spring Batch开发中XML配置的冗长常让开发者望而却步。现在借助Spring Boot的自动化配置和现代注解体系我们能用极简代码实现专业级批处理能力。1. 环境准备与项目初始化首先通过Spring Initializr创建项目骨架只需勾选两个核心依赖dependencies dependency groupIdorg.springframework.boot/groupId artifactIdspring-boot-starter-batch/artifactId /dependency dependency groupIdorg.springframework.boot/groupId artifactIdspring-boot-starter-test/artifactId scopetest/scope /dependency /dependencies注意Spring Batch 5.x版本需要JDK 17支持若使用JDK 8可选择2.7.x版本创建基础启动类时关键是要排除数据源自动配置除非需要数据库持久化任务状态SpringBootApplication(exclude {DataSourceAutoConfiguration.class}) public class BatchApplication { public static void main(String[] args) { SpringApplication.run(BatchApplication.class, args); } }2. 注解驱动的批处理配置核心配置类只需两个注解即可激活批处理环境Configuration EnableBatchProcessing public class FileBatchConfig { Autowired private JobBuilderFactory jobBuilderFactory; Autowired private StepBuilderFactory stepBuilderFactory; }与传统XML配置相比注解方式有三大优势类型安全编译器可检查Bean类型匹配代码导航IDE支持直接跳转到实现配置集中所有组件定义在同一文件3. 构建文件处理流水线假设我们要处理学生成绩单CSV文件计算每个学生的总分。先定义领域模型Data AllArgsConstructor NoArgsConstructor public class StudentRecord { private String studentId; private int math; private int physics; private int chemistry; } Data AllArgsConstructor NoArgsConstructor public class StudentSummary { private String studentId; private int totalScore; }3.1 配置读写组件使用FlatFileItemReader构建CSV读取器Bean public FlatFileItemReaderStudentRecord csvReader() { return new FlatFileItemReaderBuilderStudentRecord() .name(studentReader) .resource(new ClassPathResource(scores.csv)) .delimited() .names(studentId, math, physics, chemistry) .fieldSetMapper(new BeanWrapperFieldSetMapper() {{ setTargetType(StudentRecord.class); }}) .build(); }对应的文件写入器配置Bean public FlatFileItemWriterStudentSummary csvWriter() { return new FlatFileItemWriterBuilderStudentSummary() .name(summaryWriter) .resource(new FileSystemResource(output/summary.csv)) .lineAggregator(new DelimitedLineAggregator() {{ setDelimiter(|); setFieldExtractor(new BeanWrapperFieldExtractor() {{ setNames(new String[]{studentId, totalScore}); }}); }}) .build(); }3.2 实现处理逻辑创建处理器计算总分public class ScoreCalculator implements ItemProcessorStudentRecord, StudentSummary { Override public StudentSummary process(StudentRecord item) { int total item.getMath() item.getPhysics() item.getChemistry(); return new StudentSummary(item.getStudentId(), total); } }4. 组装批处理任务将组件组合成完整任务Bean public Job calculateTotalScoresJob() { return jobBuilderFactory.get(scoreCalculation) .start(processStep()) .build(); } Bean public Step processStep() { return stepBuilderFactory.get(calculateStep) .StudentRecord, StudentSummarychunk(100) .reader(csvReader()) .processor(new ScoreCalculator()) .writer(csvWriter()) .build(); }关键参数说明chunk(100)每处理100条记录后执行一次写入reader/processor/writer构成完整处理链5. 运行与验证准备测试文件scores.csvs1001,85,92,88 s1002,78,85,90 s1003,92,95,89启动应用后查看output/summary.csv将看到s1001|265 s1002|253 s1003|276控制台会输出处理日志Processing student: s1001 with total 265 Processing student: s1002 with total 253 Processing student: s1003 with total 276 Job completed in 450ms6. 高级配置技巧6.1 任务监听与监控添加任务生命周期监听Bean public JobExecutionListener jobListener() { return new JobExecutionListener() { Override public void beforeJob(JobExecution jobExecution) { System.out.println(Job starting: jobExecution.getJobInstance().getJobName()); } Override public void afterJob(JobExecution jobExecution) { System.out.println(Job completed with status: jobExecution.getStatus()); } }; }在Job配置中添加监听器Bean public Job calculateTotalScoresJob() { return jobBuilderFactory.get(scoreCalculation) .listener(jobListener()) .start(processStep()) .build(); }6.2 多步骤任务复杂任务可拆分为多个步骤Bean public Job multiStepJob() { return jobBuilderFactory.get(advancedJob) .start(prepareStep()) .next(calculateStep()) .next(exportStep()) .build(); }6.3 异常处理策略配置跳过规则和重试机制Bean public Step faultTolerantStep() { return stepBuilderFactory.get(safeStep) .StudentRecord, StudentSummarychunk(50) .reader(csvReader()) .processor(calculator()) .writer(csvWriter()) .faultTolerant() .skipLimit(10) .skip(NumberFormatException.class) .retryLimit(3) .retry(DeadlockLoserDataAccessException.class) .build(); }7. 性能调优建议合理设置chunk大小内存充足时增大chunk size500-1000大数据量时适当减小50-100并行处理配置Bean public Step parallelStep() { return stepBuilderFactory.get(parallelStep) .StudentRecord, StudentSummarychunk(100) .reader(csvReader()) .processor(calculator()) .writer(csvWriter()) .taskExecutor(new SimpleAsyncTaskExecutor()) .throttleLimit(4) .build(); }JVM参数优化-Xms512m -Xmx2G -XX:UseG1GC实际项目中我曾处理过包含200万条记录的成绩单文件通过调整chunk size为500并启用并行处理将运行时间从45分钟缩短到7分钟。关键是要在开发环境进行多轮性能测试找到最佳参数组合。

手把手教你用STM32F103C8T6+ESP-01s做个桌面天气站（附心知天气API申请避坑指南）

从零打造智能桌面天气站：STM32与ESP8266的完美协作 1. 项目概述与核心组件选型在物联网技术蓬勃发展的今天，DIY一个个性化的智能天气显示终端已成为创客们入门嵌入式开发的经典项目。这个看似简单的装置，实则融合了微控制器编程、无线通信协…...

2026/5/1 2:49:06 阅读更多 →

数字IC后端工程师的日常：一次搞懂PR流程中的那些‘黑话’与核心工具（Astro/Star-RCXT实战解析）

数字IC后端工程师的日常：解码P&R流程中的专业术语与工具实战第一次打开Astro界面时，那些密密麻麻的菜单栏和术语让我想起了刚学外语时的词典——每个单词都认识，但连成句子就完全看不懂。Floorplan难道是要我画建筑平面图？CT…...

2026/5/1 2:48:58 阅读更多 →

基于开源框架构建领域专家级AI代理：从原理到代码评审实战

1. 项目概述：一个“专家代理”的诞生与价值在AI应用开发领域，我们常常面临一个困境：通用大模型（LLM）虽然知识广博，但在特定垂直领域的深度、精准度和执行效率上，往往力有不逮。这就好比请一位博…...

2026/5/1 2:48:03 阅读更多 →

如何3步完成百度文库文档纯净提取：突破付费限制的实用解决方案

如何3步完成百度文库文档纯净提取：突破付费限制的实用解决方案【免费下载链接】baidu-wenku fetch the document for free 项目地址: https://gitcode.com/gh_mirrors/ba/baidu-wenku 在信息获取过程中，百度文库的付费门槛、广告干扰和内容加载限…...

2026/4/30 23:34:59 阅读更多 →

zmq源码分析之DEALER/ROUTER 路由机制的应用场景

文章目录 1. 服务集群与负载均衡 2. 消息代理与路由器 3. 异步 RPC 系统 4. 聊天服务器 5. 游戏服务器 6. 金融交易系统 7. 物联网系统 8. 微服务架构代码示例：服务集群负载均衡器 (ROUTER) 服务实例 (DEALER) 客户端总结 DEALER/ROUTER 模式凭借其强大的路由能力和异步特性…...

2026/5/1 1:09:00 阅读更多 →

3分钟恢复Windows 11任务栏拖放功能：简单高效的终极解决方案

3分钟恢复Windows 11任务栏拖放功能：简单高效的终极解决方案【免费下载链接】Windows11DragAndDropToTaskbarFix "Windows 11 Drag & Drop to the Taskbar (Fix)" fixes the missing "Drag & Drop to the Taskbar" support in Windows…...

2026/4/30 19:16:10 阅读更多 →