We are seeking an AI Senior Engineer - Vision to work at the cutting edge of Computer Vision and Logic. You will be responsible for extracting complex data from visual documents and orchestrating how that data is used by Large Language Models.
Requirements
- Unlocking Visual Data: Building pipelines that can'read' complex documents, understanding layout, charts, and visual context using Vision-Language Models (GPT-4V, Claude 3.5) and Layout Analysis.
- Orchestrating Intelligence: Owning the application logic layer. You will use LangChain or LangGraph to build the agents and chains that query our data, reason about it, and generate responses.
- Native PDF Handling: Handling the messy reality of PDF processing (PyMuPDF, layout parsing) to preserve structure before the AI even sees it.
- Prompt Engineering & Logic: Crafting complex prompts and control flows to ensure models interpret financial charts and layouts accurately without hallucinating.
- Cost & Scale: Applying a cost-optimization mindset (batch processing, model selection) to ensure our vision and orchestration layers are economically viable.
Benefits
- Generous Paid Time Off
- 401k Matching
- Tuition Reimbursement