Hi, I'm Pankaj.
Data & AI Engineer.
Building intelligent, production-ready systems across LLM/RAG, data pipelines, and AI workflow automation — with over 7 years of remote engineering experience.

A short version of the story.
Welcome to my professional space. I'm a Data & AI Engineer with 7+ years of remote experience, specializing in LLM/RAG systems, data pipeline architecture, and AI workflow automation using tools like LangChain and n8n.
Currently advancing my expertise through the B.Sc. in Data Science and Artificial Intelligence at IIT Guwahati — one of India's premier technology institutes, with a curriculum focused on practical, industry-aligned learning across data engineering, MLOps, and applied AI.
I design and develop intelligent, production-ready systems that merge data engineering principles with modern AI integration. From building reliable ETL pipelines to embedding LLMs into business workflows, I deliver end-to-end solutions that improve automation, efficiency, and decision-making.
My work focuses on Retrieval-Augmented Generation (RAG), LLM orchestration, API integration, and AI-driven automation across cloud environments like AWS and GCP. Always open to collaborating with forward-thinking, data-centric teams.
Services & capabilities
A focused set of skills across the modern AI & data stack — from foundational engineering to applied LLM systems.
Data Science & ML Engineering
Advanced data analysis, machine learning, and visualization — transforming complex datasets into actionable insights.
Vector Database Integration
Semantic search and LLM recall with Pinecone, Weaviate, or FAISS. Boosting performance in chatbots, agents, and RAG apps.
Deep Learning
Designing, training, and optimizing neural networks. Solving complex real-world problems with precision.
Prompt Engineering
Optimized prompts for classification, extraction, and reasoning. Fine-tuned for consistent, project-aligned performance.
LLM & RAG Systems
Context-aware applications using LLMs and retrieval-augmented generation. Prompt engineering, vector search, dynamic pipelines.
Problem Solving
Deep grasp of algorithms and data structures — breaking complex challenges into manageable, efficient solutions.
Natural Language Processing
Sophisticated algorithms that understand, process, and generate human language. Linguistics meets machine learning.
AI Voice Assistants
Voice agents with ElevenLabs, Twilio, and Retell AI. Dynamic conversations, call routing, and lead qualification.
LangChain Development
Intelligent agents and tool-using workflows with memory, APIs, and vector stores. Fallback logic and chaining for production.
AI-Powered Data Pipelines
Scalable ETL pipelines enriched with ML, real-time processing, robust validation, and seamless cloud integration.
MLOps
Faster time-to-market, improved model governance, and stronger collaboration between data science and IT teams.
Advanced Data Analytics
Python, Pandas, Plotly — clear, actionable insights and visualizations that support data-driven decision-making.
n8n Workflow Automation
Powerful automation workflows connecting APIs, databases, and AI services. Streamlined business logic, scalable and maintainable.
AI Chatbot & Agent Design
Intelligent chat and voice agents with memory, tool use, and custom actions. Tailored to specific workflows.
Data Engineering
Apache Spark, Hadoop, SQL/NoSQL — scalable, fault-tolerant data architectures for modern analytics.
Cloud & DevOps for AI
AWS, GCP, DigitalOcean. Containerization and automation for scalable AI deployments and continuous integration.
What people say
Pankaj is a good communicator, understands requirements, and delivers on time. He has shown great proficiency with React, TypeScript, and Node.js — and mastered SVG and WYSIWYG work with us. He would be a great asset to any web design or web-based design application project. Funds permitting, I will definitely hire him in the future.
My work process
A simple, transparent six-step process from first conversation to deployment.
Discuss
Understanding your goals, constraints, and what success looks like for this project.
Develop
Building iteratively with clean code, tests, and frequent check-ins so nothing surprises you.
Ideate
Mapping the problem to the right architecture, tools, and approach — with trade-offs upfront.
Test
Validation across edge cases, performance, security, and the user experience.
Design
Drafting the system: data flow, model choices, integrations, and rollout plan.
Launch
Production deployment with monitoring, documentation, and a handoff that lasts.
A few fun facts
Have an idea worth building?
Let's talk about AI engineering, agentic systems, RAG applications, or anything in the modern data stack. I'm always open to a good conversation.
