AI Tooling & Data Infrastructure Engineer - building pipelines, tools, and products that actually ship.
6 years of code. 18 years old. Lagos, Nigeria. Open to remote worldwide.
Two tools I built that work together as a complete data pipeline:
PhantomCrawl - 4-layer web scraper
TLS fingerprinting, anti-bot evasion, XHR interception, headless browser fallback. Scraped Cloudflare.com across 100+ pages with zero blocks. Ships as a single binary under 20MB.
PhantomClean - data cleaning pipeline
Regex stripping, boilerplate frequency detection, quality scoring, and a multi-provider AI cascade (Groq, OpenAI, Anthropic). Watches the scraped folder in real time, processes in batches, exports clean datasets. Single binary under 20MB.
Cloudflare Dataset - scraped and cleaned overnight. Free.
phantomit - npm i -g phantomit-cli
Watches your code, diffs changes on save, generates professional git commit messages via Groq AI. Live on npm.
Privacy-first web analytics. No cookies, one script tag. 10+ active users in the US. Built with PHP + MySQL.
Assignment management platform for teachers and students. Real-time grading, file uploads, dashboards. Next.js + TypeScript + PostgreSQL.
High-performance rate limiting library. Token bucket and sliding window. 10k+ requests/second. Zero dependencies.
Real-time 3D ring configurator for jewelry e-commerce. 360 rotation, live material swapping. Next.js + Three.js + WebGL.
Languages: Go · TypeScript · Python · PHP · JavaScript
Frontend: Next.js · React · Tailwind · Three.js · Framer Motion
Backend: Node.js · FastAPI · REST APIs · CLI tooling
Databases: PostgreSQL · MySQL · SQLite
Tooling: Git · Docker · Linux · npm
- Publishing free AI training datasets at Phantom Datasets
- Building in public - tools, pipelines, real products
- Open to remote fullstack or backend roles
| Portfolio | var-raphael.vercel.app |
| samuelraphael925@gmail.com | |
| samuel-raphael |
Available for work · Remote worldwide · Building in public