Staff Software Engineer - OCR / Text Extraction
NetDocuments
hybridleadpermanentfullstackbackend United States 3 days ago via LinkedIn
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
OCRText ExtractionAWSKafkaEvent-Driven ArchitectureFull StackCloud-NativeTesseractApryse OCRSystem Architecture
About the role
Role Overview
Staff Software Engineer focused on OCR / Text Extraction within NetDocuments’ document content extraction and transformation team. You will provide technical leadership, drive system architecture, and help deliver scalable, event-driven platforms powering mission-critical document solutions.
Responsibilities
- Set technical direction for document content extraction and transformation systems (scalable, secure, performant) on AWS.
- Lead architectural decisions using OCR technologies such as Tesseract and Apryse OCR.
- Improve scalability, performance, reliability, and manage cost controls without reducing customer satisfaction.
- Guide evolution of the content extraction/transformation technology stack as needs scale.
- Make technical decisions balancing UX, performance, security, and maintainability.
- Design and implement event-driven architectures using AWS services, Kafka, and modern data pipelines.
- Build production-grade applications across frontend and backend to support next-generation document management systems.
- Collaborate with product, design, and engineering leadership to define system direction.
- Mentor engineers and drive technical excellence.
- Integrate cutting-edge capabilities, including AI-driven services and event-based data pipelines.
Requirements
- Experience building and architecting full stack and/or backend systems at scale.
- Strong technical leadership capability (setting direction, guiding teams, making key architecture decisions).
- Hands-on experience with OCR approaches/tools (e.g., Tesseract, Apryse OCR).
- Experience designing event-driven systems and building production platforms.
Nice-to-Haves
- Cloud-native architecture expertise, particularly with AWS.
- Experience with Kafka and data pipeline technologies.
- Experience integrating AI-driven services into production systems.
About NetDocuments
NetDocuments is the world’s #1 cloud-based content management and productivity platform for legal professionals. It helps customers manage documents and collaboration workflows with a customer-centric product approach and a strong focus on employee growth and innovation in an inclusive environment.
Scraped 4/24/2026