I design and ship production GenAI systems β the messy, end-to-end kind, not just demos. MSc Artificial Intelligence (Distinction) from Queen Mary University of London, 4+ years of engineering experience, and a track record of taking ambiguous problems through architecture, deployment, and adoption.
Most recently I co-founded Scintilink (Dec 2024 β Sept 2025), a multi-agent AI platform that served 100+ researchers in production. Before that, I spent three years at Lumen Technologies shipping AI-driven forecasting tools that informed $25M+ in annual network investment decisions.
I care about AI that's safe, reliable, and actually useful β not just impressive in a screenshot.
An 8-agent system orchestrated with LangGraph, deployed on GCP, helping researchers reason over scientific literature with citation-backed answers.
| Metric | Result |
|---|---|
| Processing latency | β40% via intelligent routing & state management |
| Extraction accuracy | +30% through context engineering |
| Documents analysed | 500+ automated runs in production |
| Retrieval architecture | Hybrid BM25 + OpenAI embeddings + Reciprocal Rank Fusion |
Fine-tuned LLM + Multi-Agent RAG + Graph Neural Networks
- Fine-tuned Microsoft Phi-4 (14.7B params) with LoRA on FinQA β +550% accuracy on financial QA
- 3-way LangGraph query router over a 68,989-embedding Qdrant pipeline
- Streaming ingestion of 18,000+ SEC EDGAR filings β knowledge graph of 17,552 nodes / 420,796 edges
- Four GNN architectures: GraphSAGE, GATv2, Temporal GNN, Global Graph RAG
Cost/latency-aware model routing with confidence escalation
A LangGraph pipeline that classifies intent, scores mission-criticality and latency-sensitivity, then picks the optimal model and deployment target β with 0.65 confidence-threshold escalation, intent-aware fallbacks, and full Pydantic pipeline traces for observability.
Hybrid retrieval over BMW, Tesla, and Ford annual reports
BM25 (30%) + semantic (70%) fused via Reciprocal Rank Fusion across 2,871 chunks β 85.9% keyword match, 100% pass rate on 13-case eval suite.
Sub-500ms search on 100k+ documents Β· BM25Okapi Β· multi-threaded.
Agentic & LLM β LangGraph Β· LangChain Β· LangSmith Β· HuggingFace Transformers Β· LoRA Β· PEFT Β· OpenAI Β· Anthropic Β· Groq Β· FlashAttention-2
RAG & Retrieval β Hybrid retrieval (BM25 + semantic + RRF) Β· Qdrant Β· ChromaDB Β· Pinecone Β· Multilingual-E5-Large Β· knowledge graphs
Evaluation & Observability β Custom eval harnesses Β· A/B testing Β· agent decision logs Β· telemetry dashboards Β· regression testing
| π | Co-founded and shipped a production 8-agent platform serving 100+ researchers |
| β‘ | Cut multi-agent processing latency by 40% through intelligent routing |
| π― | Achieved 550% accuracy improvement fine-tuning a 14.7B-parameter LLM |
| π° | Built forecasting tools supporting $25M+ in annual network investment decisions |
| β±οΈ | Delivered a critical product 7 months ahead of schedule (2 months vs. 9-month estimate) |
| ποΈ | Optimised an Oracle PL/SQL system handling 10M+ monthly queries β +60% SQL performance |
| π | MSc Artificial Intelligence β Distinction, Queen Mary University of London |
mindmap
root((What I'm digging into))
Agentic AI
Multi-agent coordination patterns
Tool-use planning and verification
Long-horizon task decomposition
Evaluation
Agent decision logging
Failure-case taxonomy
LLM-as-judge calibration
Retrieval
Hybrid search tuning
Query decomposition
Contextual compression
Safety and Reliability
Guardrails for production
Human-in-the-loop design
Regression testing for LLMs
MSc Artificial Intelligence β Queen Mary University of London (2024β2025) Β· Distinction
B.Tech Computer Science & Systems Engineering β KIIT University (2017β2021)
| Provider | Certification |
|---|---|
| DeepLearning.AI | Neural Networks & Deep Learning Β· Improving Deep Neural Networks Β· Structuring ML Projects |
| Google Cloud | Introduction to Generative AI Β· Google Cloud Essentials |
| Duke University | Business Metrics for Data-Driven Companies |
| Rice University | Hypothesis Testing & Confidence Intervals |
Open to: Applied AI / AI engineer roles focused on agentic systems, RAG, and production LLM work. Based in London, UK β happy to discuss remote, hybrid, or relocation.
- πΌ LinkedIn β neelay-choudhury-768537152
- π§ neelaychoudhury1999@gmail.com