Perception
VLM inputs, RealSense color/depth streams, masks, boxes, and object-state output.
Field notes from AI application work
I build practical AI application work around Qwen VLM demos, FastAPI/vLLM APIs, batch detection, RealSense + SAM tracking, Jetson AGX Orin tests, and RAG/LoRA experiments. I keep inputs, logs, screenshots, and setup notes close to the work so another engineer can check it.
VLM inputs, RealSense color/depth streams, masks, boxes, and object-state output.
Prompt design, JSON parsing, RAG/LoRA experiments, and model API workflows.
2D/3D anchors, pose hints, logs, CSV summaries, and notes another engineer can inspect.
Jetson AGX Orin, JetPack 6.2, RK3588/RKNN, Docker/Linux, and deployment checks.
Working stack
project notes with source links or demo previews
AI workstreams: model APIs, multimodal detection, robotics perception
static Cloudflare Pages site, no embedded API keys, resume included
Run evidence
I keep the evidence visible because it is more useful than a polished claim. Some older projects run cleanly; some need dependency fixes or external services, which I note directly instead of hiding it.
Robotics AI Demo Lab
I kept this lab compact on purpose: each mode shows a runnable slice, a replay, or a constrained API-backed draft. Qwen uses DashScope through a server-side Pages Function; Property and Finance can use DeepSeek the same way, without exposing keys in browser code.
Reference-guided detection with recorded samples and an optional live Qwen3-VL call.
Endpoint status: recorded replay ready. Click Try live endpoint to call the server-side Qwen/VLM function.
Selected work
I kept these short on purpose. Each card gives the problem, what I built, and a piece of evidence. The demo buttons below update the interactive panel without leaving the page.
Showing 6 projects
GenAI + computer vision
Full-stack AI app
Robotics perception
Security + web
React + Firebase
AI workflow
No matching projects. Try a different keyword or switch back to All.
Interactive preview
These are static previews, not live model inference. I use them to show inputs, outputs, and the small checks around each project.
How I work
For AI demos, I care about files and screenshots that let someone else verify the result. I keep input pairs, rendered outputs, logs, CSV summaries, timing notes and setup steps close to the work.
My usual tools are Codex, Antigravity, Hermes OpenClaw and Coze. I use them for scaffolding, debugging, workflow drafts and documentation. The useful habit is checking assumptions, running the code, and trimming generated text until it sounds like something I would actually say.
My recent work often sits between model behavior, cameras, APIs, Linux boxes, and demo deadlines. I am comfortable making the first version work, then writing enough notes that the next person is not guessing.
Experience
Worked on robotics AI demos around Qwen-family models and model APIs. Day to day, I used FastAPI, Gradio, vLLM, RealSense/SAM tracking, batch detection scripts and edge AI tests on Jetson AGX Orin and RK3588/RKNN devices. The work covered setup, scripts, outputs, reports, timing checks and demo maintenance.
Handled client requirements and server setup for ESXi/RAID, Docker/Nginx, reverse proxies and tunneling. I also worked on BeamNG and Minecraft server deployment, mod testing, troubleshooting and handoff notes.
Supported LLM deployment experiments, cloud server maintenance, AI course materials and Jupyter labs. I also helped with exams, project-site maintenance and documentation updates.
Contact
Best fit: LLM/VLM application engineering, Python/FastAPI services, robotics demo tooling, multimodal workflows, and full-stack prototypes that need practical delivery.