Home About Experience Projects Skills Certifications Contact
Available for opportunities

Hello, I'm Parth Panchal AI/ML Engineer

Building production-grade LLM, RAG, and Voice AI systems. Specialized in ML/GenAI pipelines, low-latency inference, and cloud deployment with hands-on ownership from experimentation to production.

0 + Years Experience
0 + AI/ML Projects
0 + Certifications
ai_engineer.py
class AIEngineer:
    def __init__(self):
        self.name = "Parth Panchal"
        self.skills = [
            "LLM", "RAG", "Voice AI",
            "PyTorch", "LangChain",
            "Azure", "MLOps"
        ]
        self.analytical_thinker = True
        self.startup_ready = True
    
    def build_ai_systems(self):
        return "Production Ready"
Scroll to explore

About Me

Parth Panchal - AI Engineer
3+ Years of
Experience

Engineering the Next Generation of Intelligent Systems

AI/ML Engineer with around 3+ years of experience building production-grade LLM, RAG, and AI/ML systems. Specialized in ML/GenAI pipelines, low-latency inference, and cloud deployment, with hands-on ownership from experimentation to production in startup environments.

I enjoy working in a role that offers diverse challenges, fosters innovation, and allows me to collaborate on cutting-edge AI/ML projects.I thrive in fast-paced environments where I can build and ship AI products that make a real impact.

Voice AI Systems

STT, TTS, Real-time LLM Integration

RAG & LLMs

LangChain, LangGraph, Prompt Engineering

Cloud & MLOps

Azure, AWS, GCP, Docker, MLflow

MLOps & Infra

Docker & Low-latency inference serving.

Work Experience

Machine Learning Engineer

Rootle AI — Voice AI Product Startup

Aug 2025 - Present
  • Built and optimized real-time voice AI platform integrating STT, TTS, LLMs, and retrieval systems
  • Designed automated multi-document ingestion and RAG pipeline supporting PDFs, PPTs, and web URLs.
  • Implemented NVIDIA NV-Ingest for large-scale, heterogeneous knowledge sources
  • Optimized PyTorch-based inference pipelines, reducing end-to-end latency by ~25%
  • Fine-tuned and deployed BERT-based intent classifiers for low-latency real-time voice calls
Voice AI LLM RAG PyTorch NV-Ingest BERT

Machine Learning Engineer

Cilans System

May 2023 - Aug 2025
  • Researched, modified, and deployed ML prototypes across multiple client projects
  • Designed and deployed end-to-end ML and GenAI pipelines, reducing development turnaround time by ~30-40%
  • Built LLM-powered document understanding and search workflows using open-source models
  • Collaborated directly with clients to deliver customized ML and GenAI solutions
  • Performed exploratory data analysis and feature engineering to improve model performance
Azure AI LangChain TensorFlow fine-tuning GenAI

Data Science Intern

Kron Labs

Nov 2022 - May 2023
  • Supported data scientists, BI developers, and analysts across data analysis and ML initiatives
  • Developed computer vision model for deployment on NVIDIA Jetson Nano devices
  • Collaborated using Git-based workflows for version control and task management
Computer Vision NVIDIA Jetson Python Machine Learning Git

Featured Projects

Production-grade AI/ML projects showcasing expertise in LLMs, RAG, and cloud deployment

Skills & Technologies

Languages

Python SQL MySQL PostgreSQL

Machine Learning & Deep Learning

PyTorch TensorFlow Keras Scikit-learn XGBoost

GenAI & LLMs

LangChain LangGraph RAG Prompt Engineering Tool Calling NV-Ingest

MLOps & Orchestration

MLflow DVC ZenML GitHub Actions Docker n8n

Cloud & AI Services

Azure AI Search Azure ML Azure Functions AWS GCP Vertex AI ElevenLabs

Certifications & Achievements

Latest Articles

Sharing knowledge on AI/ML engineering, best practices, and industry insights

Let's Work Together

Have an AI/ML project in mind? I'm always open to discussing innovative projects, startup opportunities, and tech collaborations.