Skip to content
View neelay123's full-sized avatar
πŸ’­
I may be slow to respond.
πŸ’­
I may be slow to respond.
  • London

Block or report neelay123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
neelay123/README.md

Header

Typing SVG

Portfolio LinkedIn Email


About

I design and ship production GenAI systems β€” the messy, end-to-end kind, not just demos. MSc Artificial Intelligence (Distinction) from Queen Mary University of London, 4+ years of engineering experience, and a track record of taking ambiguous problems through architecture, deployment, and adoption.

Most recently I co-founded Scintilink (Dec 2024 – Sept 2025), a multi-agent AI platform that served 100+ researchers in production. Before that, I spent three years at Lumen Technologies shipping AI-driven forecasting tools that informed $25M+ in annual network investment decisions.

I care about AI that's safe, reliable, and actually useful β€” not just impressive in a screenshot.


πŸš€ Recent Work

Scintilink β€” Multi-Agent Research Platform Co-founder Β· Dec 2024 – Sept 2025

An 8-agent system orchestrated with LangGraph, deployed on GCP, helping researchers reason over scientific literature with citation-backed answers.

Metric Result
Processing latency βˆ’40% via intelligent routing & state management
Extraction accuracy +30% through context engineering
Documents analysed 500+ automated runs in production
Retrieval architecture Hybrid BM25 + OpenAI embeddings + Reciprocal Rank Fusion

πŸ† Featured Projects

🧠 Agentic Financial Intelligence System

Fine-tuned LLM + Multi-Agent RAG + Graph Neural Networks

  • Fine-tuned Microsoft Phi-4 (14.7B params) with LoRA on FinQA β†’ +550% accuracy on financial QA
  • 3-way LangGraph query router over a 68,989-embedding Qdrant pipeline
  • Streaming ingestion of 18,000+ SEC EDGAR filings β†’ knowledge graph of 17,552 nodes / 420,796 edges
  • Four GNN architectures: GraphSAGE, GATv2, Temporal GNN, Global Graph RAG

🧭 Agentic LLM Router

Cost/latency-aware model routing with confidence escalation

A LangGraph pipeline that classifies intent, scores mission-criticality and latency-sensitivity, then picks the optimal model and deployment target β€” with 0.65 confidence-threshold escalation, intent-aware fallbacks, and full Pydantic pipeline traces for observability.

πŸš— Automotive RAG Chatbot

Hybrid retrieval over BMW, Tesla, and Ford annual reports

BM25 (30%) + semantic (70%) fused via Reciprocal Rank Fusion across 2,871 chunks β†’ 85.9% keyword match, 100% pass rate on 13-case eval suite.

πŸ“š Legal Search Engine

Sub-500ms search on 100k+ documents Β· BM25Okapi Β· multi-threaded.


πŸ› οΈ Tech Stack

Core Languages & Frameworks

Skills

Cloud, Infra & Data

Skills

Agentic & LLM β€” LangGraph Β· LangChain Β· LangSmith Β· HuggingFace Transformers Β· LoRA Β· PEFT Β· OpenAI Β· Anthropic Β· Groq Β· FlashAttention-2

RAG & Retrieval β€” Hybrid retrieval (BM25 + semantic + RRF) Β· Qdrant Β· ChromaDB Β· Pinecone Β· Multilingual-E5-Large Β· knowledge graphs

Evaluation & Observability β€” Custom eval harnesses Β· A/B testing Β· agent decision logs Β· telemetry dashboards Β· regression testing


πŸ“ˆ Selected Impact

πŸš€ Co-founded and shipped a production 8-agent platform serving 100+ researchers
⚑ Cut multi-agent processing latency by 40% through intelligent routing
🎯 Achieved 550% accuracy improvement fine-tuning a 14.7B-parameter LLM
πŸ’° Built forecasting tools supporting $25M+ in annual network investment decisions
⏱️ Delivered a critical product 7 months ahead of schedule (2 months vs. 9-month estimate)
πŸ—„οΈ Optimised an Oracle PL/SQL system handling 10M+ monthly queries β†’ +60% SQL performance
πŸŽ“ MSc Artificial Intelligence β€” Distinction, Queen Mary University of London

πŸ“Š GitHub Stats




Trophies


πŸ”­ Currently Exploring

mindmap
  root((What I'm digging into))
    Agentic AI
      Multi-agent coordination patterns
      Tool-use planning and verification
      Long-horizon task decomposition
    Evaluation
      Agent decision logging
      Failure-case taxonomy
      LLM-as-judge calibration
    Retrieval
      Hybrid search tuning
      Query decomposition
      Contextual compression
    Safety and Reliability
      Guardrails for production
      Human-in-the-loop design
      Regression testing for LLMs
Loading

πŸŽ“ Education & Certifications

MSc Artificial Intelligence β€” Queen Mary University of London (2024–2025) Β· Distinction
B.Tech Computer Science & Systems Engineering β€” KIIT University (2017–2021)

Provider Certification
DeepLearning.AI Neural Networks & Deep Learning Β· Improving Deep Neural Networks Β· Structuring ML Projects
Google Cloud Introduction to Generative AI Β· Google Cloud Essentials
Duke University Business Metrics for Data-Driven Companies
Rice University Hypothesis Testing & Confidence Intervals

πŸ“« Let's Connect

Open to: Applied AI / AI engineer roles focused on agentic systems, RAG, and production LLM work. Based in London, UK β€” happy to discuss remote, hybrid, or relocation.

Profile Views

Footer

Pinned Loading

  1. Hybrid-Multi-Agent-Financial-Intelligence-System Hybrid-Multi-Agent-Financial-Intelligence-System Public

    This integrates three core components: a domain-specific fine-tuned Phi-4 language model, a financial knowledge graph from SEC EDGAR filings, and a multi-agent workflow using LangGraph.

    Jupyter Notebook

  2. Hybrid-Legal-Search-Engine Hybrid-Legal-Search-Engine Public

    The system employs BM25 for the initial retrieval of documents, followed by semantic re-ranking using a lightweight, domain-adapted version of BERT (LEGALBERTSMALL). By integrating these two approa…

    Python

  3. Smart-Surveillance-with-Face-Recognition Smart-Surveillance-with-Face-Recognition Public

    A smart surveillance system using facial recognition system made as a part of my academic minor project submitted in 6th semester.

    Python

  4. automotive-rag-chatbot automotive-rag-chatbot Public

    A production-grade Retrieval-Augmented Generation (RAG) system for analyzing automotive industry annual reports from BMW, Tesla, and Ford.

    Python

  5. creative-automation-pipeline creative-automation-pipeline Public

    A production-ready automation system that generates localized creative assets for social media campaigns using Google Gemini API. Built to solve the challenge of creating hundreds of campaign varia…

    Python 1

  6. Agentic-LLM-router Agentic-LLM-router Public

    LangGraph-based routing pipeline. Each query goes through 4 LLM calls to figure out the best model/deployment.

    Jupyter Notebook