Skip to content
View dakshtrehan's full-sized avatar

Highlights

  • Pro

Block or report dakshtrehan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dakshtrehan/README.md

Forward Deployed Engineer at Abacus.AI. Previously 5 years at kipi.ai shipping production AI for ConocoPhillips, Gameday Men's Health, Germania Insurance, and Blue Yonder.

I build AI systems that hold up under regulated-industry scrutiny: RAG pipelines that compliance teams can sign off on, document AI that scales to 2,000+ PDFs, fraud detection that runs on 10M+ claims with full SHAP explainability.

When the work is interesting and the constraints are real, I write about it. 50+ articles across Towards Data Science, TowardsAI, and DataDrivenInvestor with 100K+ collective reads.


What I'm shipping right now

🛡️ ragcompliance — Drop-in audit-trail middleware for LangChain and LlamaIndex RAG pipelines. SHA-256-signed audit logs, opt-in PII / PHI redaction, SOC 2 evidence export. MIT licensed. 171 tests passing. PyPI

🔬 PromptDrift — Side-by-side prompt versioning and regression tracking for LLM engineers. Token-level diff with output comparison across models.

🎵 SWAR — AI music composition studio with an Artist Skill Library. Bilingual lyric generation, Tone.js melodies, Suno integration.

📈 AI Paper Trading System — Automated daily signals across 162 stocks via the Anthropic API, virtual ₹10L portfolio, GitHub Actions running every trading day.


Stack and platforms

  • AI systems: LangChain · LangGraph · Snowflake Cortex AI · RAG · agentic workflows · Anthropic API · Abacus SDK · AbacusAI
  • ML and DS: PyTorch · TensorFlow · Scikit-learn · SHAP · ensemble models · NLP · LLM
  • Data: Snowflake (6× certified) · DuckDB · FAISS · Apache Iceberg · ETL pipelines · row-level security at scale · Databricks
  • Engineering: Python · SQL · GitHub Actions · Streamlit · FastAPI · Supabase

Recognition

  • 6× Snowflake certified (incl. SnowPro Advanced Data Scientist, Architect, Data Engineer, GenAI Specialty)
  • Technical reviewer, TinyML Cookbook and Machine Learning Engineering with Python (Packt Publishing)
  • M.Tech Data Science, BITS Pilani — thesis on Smart Search using SBERT and ANNOY

Where to find me


Open to interesting side projects. Especially if it involves regulated-industry AI, compliance tooling, or anything where the constraints are real, and the data is messy.

Pinned Loading

  1. ragcompliance ragcompliance Public

    Audit trail middleware for RAG pipelines in regulated industries. Drop-in LangChain and LlamaIndex callback handler with SHA-256 chain signatures, Supabase row-level security, and SOC 2 evidence ex…

    Python 1

  2. Covid-19-Detection Covid-19-Detection Public

    The aim of this project is to help medical practitioners in a practical way to combat the pandemic.

    Jupyter Notebook 11 16

  3. Machine-Learning-Bootcamp Machine-Learning-Bootcamp Public

    This repository contains all the study material for Machine Learning for beginners. This includes all the articles written by me, code for each algorithm along with their code from scratch.

    Jupyter Notebook 6 2

  4. White-box-Cartoonization White-box-Cartoonization Public

    Forked from SystemErrorWang/White-box-Cartoonization

    Official TensorFlow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”

    Jupyter Notebook 26 8

  5. Interactive-Covid-19-Dashboard Interactive-Covid-19-Dashboard Public

    The project aims at real-time COVID-19 data analysis, recovery rate/death rate prediction, and AI-powered smart chatbot using Machine Learning techniques.

    Python 12 7

  6. AI-Music-Generation AI-Music-Generation Public

    Predicting Music pieces using LSTMs. MIDI files are used to generate and predict musical tones.

    Jupyter Notebook 8 2