Skip to content
View var-raphael's full-sized avatar
:atom:
Open to remote job
:atom:
Open to remote job

Block or report var-raphael

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
var-raphael/README.md

Raphael Samuel

AI Tooling & Data Infrastructure Engineer - building pipelines, tools, and products that actually ship.

6 years of code. 18 years old. Lagos, Nigeria. Open to remote worldwide.


Phantom Suite

Two tools I built that work together as a complete data pipeline:

PhantomCrawl - 4-layer web scraper

TLS fingerprinting, anti-bot evasion, XHR interception, headless browser fallback. Scraped Cloudflare.com across 100+ pages with zero blocks. Ships as a single binary under 20MB.

PhantomClean - data cleaning pipeline

Regex stripping, boilerplate frequency detection, quality scoring, and a multi-provider AI cascade (Groq, OpenAI, Anthropic). Watches the scraped folder in real time, processes in batches, exports clean datasets. Single binary under 20MB.

Cloudflare Dataset - scraped and cleaned overnight. Free.


Other Projects

phantomit - npm i -g phantomit-cli

Watches your code, diffs changes on save, generates professional git commit messages via Groq AI. Live on npm.

Privacy-first web analytics. No cookies, one script tag. 10+ active users in the US. Built with PHP + MySQL.

Assignment management platform for teachers and students. Real-time grading, file uploads, dashboards. Next.js + TypeScript + PostgreSQL.

High-performance rate limiting library. Token bucket and sliding window. 10k+ requests/second. Zero dependencies.

Real-time 3D ring configurator for jewelry e-commerce. 360 rotation, live material swapping. Next.js + Three.js + WebGL.


Tech Stack

Languages: Go · TypeScript · Python · PHP · JavaScript

Frontend: Next.js · React · Tailwind · Three.js · Framer Motion

Backend: Node.js · FastAPI · REST APIs · CLI tooling

Databases: PostgreSQL · MySQL · SQLite

Tooling: Git · Docker · Linux · npm


Currently

  • Publishing free AI training datasets at Phantom Datasets
  • Building in public - tools, pipelines, real products
  • Open to remote fullstack or backend roles

Contact

Portfolio var-raphael.vercel.app
Email samuelraphael925@gmail.com
LinkedIn samuel-raphael

Available for work · Remote worldwide · Building in public

Pinned Loading

  1. Ratelimiter Ratelimiter Public

    Go 1

  2. classflow classflow Public

    Full stack assignment management platform for teachers and students. Built with Next.js 14, TypeScript, Supabase, and TailwindCSS. Features role-based auth, WYSIWYG assignment creation, file upload…

    TypeScript 1 2

  3. phantomit phantomit Public

    JavaScript

  4. PhantomCrawl PhantomCrawl Public

    A 4-layer web crawler with AI cleaning, TLS fingerprinting, and anti-bot evasion. Scraped Cloudflare.

    Go 3 1

  5. PhantomClean PhantomClean Public

    Cleans the mess PhantomCrawl makes. Multi-layer data cleaner with AI, boilerplate detection, and batch processing.

    Go 2

  6. vexaro vexaro Public

    GitHub for Web data, Api's and Ai dataset

    Go 1