Applied AI | Multi-agent systems | RAG | Document intelligence

Building production-grade AI systems that get past the demo.

I am Manish, a Bangalore-based Lead AI Engineer focused on agent orchestration, retrieval systems, evaluation workflows, multimodal evidence, and cloud deployments for enterprise AI products.

Open for contract work and full-time AI engineering roles
Manish Sharma

About

Hands-on AI engineering with a builder's bias.

I work on the practical layers that make GenAI useful: workflow builders, retrieval, tool calling, structured evaluation, infrastructure, and the human loops around quality. Recent work spans pharma and life-sciences workflows, course discovery, datasheet intelligence, document extraction, and medical knowledge systems.

I like projects where research, product thinking, and shipping all sit at the same table. The sweet spot: taking ambiguous AI ideas and turning them into reliable systems that teams can run, measure, and improve.

Credentials

Research, education, and engineering credibility.

2021 - 2022

Research Assistant - Speech and Vision

Indian Institute of Science, Bangalore - SpireLabs

  • Worked on OCR and speech-recognition research pipelines with Hindi text and audio datasets.
  • Used PyTesseract, EasyOCR, Librosa, MelSpectrograms, and word-level accuracy evaluation.
  • Built the foundation for later production work in OCR, document AI, and multimodal retrieval.
2018 - 2022

B.Tech - Electronics and Communication Engineering

Nitte Meenakshi Institute of Technology

  • Graduated with 8.98 GPA.
  • Built a strong base in signal processing, systems, algorithms, and applied ML.
  • Moved from speech and vision research into production AI engineering roles.

Experience

Selected work

Dec 2022 - Dec 2023 Docsumo AI

Machine Learning Scientist

  • Designed and deployed document KV and table extraction systems using LayoutLM, BROS, and YOLO for fixed and unstructured documents.
  • Integrated ML and deep learning models into 10+ client APIs supporting $80K-$100K MRR workflows.
  • Built Chat-AI with LangChain and Pinecone for product QA and support tasks.
  • Reduced annotation time from one day to about two hours using GPT-4-powered KV extraction.
2021 - 2022 Indian Institute of Science, Bangalore

Research Assistant - Speech and Vision, SpireLabs

  • Worked with the Speech and Vision research group on OCR and ASR data preparation, extraction, and evaluation workflows.
  • Collated and preprocessed large Hindi datasets for OCR and speech-recognition experiments.
  • Used PyTesseract and EasyOCR for text extraction and evaluated OCR output using word-level accuracy.
  • Applied Librosa and MelSpectrogram-based audio processing for ASR experimentation.
Aug 2021 - Nov 2022 Zealth-AI (YC W21)

Machine Learning Engineer

  • Built OCR extraction and medical mapping flows for reports and prescriptions using Amazon Textract, RxNORM, MedXN, and SciSpacy.
  • Created a Neo4j Aura medical knowledge graph with 11K+ relationships and 3K+ nodes.
  • Used Rasa NLU to build a medical AI chatbot for symptom-based care management queries.

Projects

Side builds and experiments

01

RAG - Video-Based RAG System

Query YouTube URLs or uploaded videos, index chunks in Qdrant, retrieve relevant frames and timestamps, and answer with a Streamlit QA interface.

02

AutoCommit Generator

A fully local commit-message generator using Ollama and Mistral, packaged as a fast terminal workflow in under 100 lines of bash.

03

Company Scraper - AI-Powered Agent

A lightweight Relevance AI agent that turns company URLs into clean markdown briefs covering overview, products, features, audience, and integrations.

Skills

Agentic AI, platform engineering, and production ML stack.

Agent engineering

AgentsMulti-agent collaborationAgent orchestrationTask routingTool callingReusable agent nodesWorkflow compositionNo-code agent builderAgent deployment

Agent evaluation

Agent tracingEngineering harnessEvaluation harnessExperiment trackingComparative testingLLM-as-judgeTool-call evaluationCompliance scoringResponse fidelityAccuracy checksTask completion

RAG and retrieval

RAGMulti-agent RAGMultimodal RAGVector searchEmbeddingsSimilar-event retrievalQdrantFAISSPinecone

Models and frameworks

GPT-4oGeminiLlama 3.3 70BPyTorchHugging FaceLangChainLangGraphLlamaIndexStrands Agent

Document AI and NLP

OCRLayoutLMBROSYOLOTable extractionKV extractionSpaCySciSpacyMedXNRxNORMLibrosa

Cloud and deployment

AWSEKSECSBedrockAPI GatewayDockerKubernetesProduction deploymentsStaging/prod partitions

Data pipelines and DP

DatabricksData pipelinesSmartsheet Data VaultGPLM architectureDynamoDBS3GCSMySQLNeo4jHasura

Backend and product systems

PythonFastAPIFlaskAPI designBackend architectureWorkflow buildersAnnotation workflowsUser/FAM/QE flowsLinux

Builder tools

CursorOllamaOpenRouterModal LabsGoogle ColabKaggleGitHubTableauAmplitudeLovable

Contact

Need someone to build, evaluate, or rescue an AI workflow?

I am open to contract projects and full-time positions in applied AI, GenAI platforms, document intelligence, RAG, and agentic systems.

Email Manish Connect on LinkedIn

Bangalore, India | manish.tinkering@gmail.com