← Back to portfolio

Ashesh Kaji

New York, NY · ashesh8500@gmail.com · ask9184@nyu.edu · +1 (858) 308-6586 · asheshkaji.com · github.com/ashesh8500
Education
MS in Computer Engineering
NYU Tandon School of Engineering
01/2026 – 12/2027 (expected)
Coursework: System Optimization Methods, Computer Systems Architecture, Special Topics in Computer Engineering. Research focus: portfolio optimization as layered control, convex optimization, reinforcement learning for financial decision-making.
BS with Honors in Cognitive Science
Specialization in Machine Learning and Neural Computation · UC San Diego
09/2021 – 06/2025
Honors thesis on statistical modeling of environmental exposures and neuroimaging data. Coursework: Deep Learning, Reinforcement Learning, Probability & Statistics, Linear Algebra, Algorithms.
Work Experience
Consulting AI Engineer
SageX Global · Remote
01/2026 – Present
  • Architect and deploy production AI infrastructure: LLM serving pipelines, MCP (Model Context Protocol) servers, and agent orchestration systems with LangChain and AI-SDK
  • Design custom data transformation and fine-tuning strategies across multi-modal data lifecycle requirements
  • Build and maintain ACP-compliant agent communication protocols for distributed AI systems
LangChainAI-SDKACPMCP ServersPythonRustDockerCloudflare WorkersAzureAWS
Artificial Intelligence Engineer
SageX Global · Remote
09/2024 – 01/2026 · 1 year full-time
  • Led development of multiple production AI products: LLMs, small language models (SLMs), statistical mapping models, and semantic database memory retrieval systems
  • Designed end-to-end data pipelines for fine-tuning and deploying models with rigorous evaluation methodology
  • Built RAG systems integrating vector databases with serverless deployment on Azure and AWS
PyTorchScikit-LearnAzureAWSMLOpsNLPRAGLLMsVector Databases
Machine Learning Intern
UniQreate · Remote
09/2023 – 07/2024
  • Built and deployed a production RAG system for document intelligence using LLMs, vector databases, and serverless cloud infrastructure on Azure
  • Designed data extraction pipelines with rapid versioning, architecture design, and production testing cycles
PythonPyTorchAzure (Blob + Serverless)RAGLLMsVector DBs
Undergraduate Research Assistant
UC San Diego · Dr. Mary Boyle's Lab · San Diego, CA
10/2022 – 06/2025
  • Applied statistical modeling (regression, causal inference) to analyze UK BioBank and ABCD Study datasets for relationships between biomarkers and neuroimaging outcomes
  • Developed reproducible analysis pipelines in Python using NumPy, Pandas, and Statsmodels for epidemiology research
PythonNumPyPandasStatsmodelsStatistical ModelingEpidemiology
Research & Projects
Portfolio Allocation as Layered System Optimization
End-to-end quantitative research pipeline for portfolio construction. Designed a walk-forward backtesting engine with convex quadratic programming (Markowitz with ℓ₁ turnover penalty), regime detection via statistical jump models, and a policy-gradient reinforcement learning router for dynamic strategy selection. Modeled multi-agent competition for liquidity as a Nash equilibrium problem using iterative best-response with crowding penalties. Validated across top-100, top-250, and top-500 equity universes with provenance-ledger tracking to eliminate look-ahead bias.
Tech: Python, cvxpy, NumPy/SciPy, PyTorch, PufferLib, Nautilus Trader, D3.js
Research presentation ↗  ·  Interactive D3 demo ↗  ·  github.com/ashesh8500/fractal ↗
zkPHIRE: Zero-Knowledge Proof Accelerator on FPGA
Implemented a paper-parity hardware accelerator for the zkPHIRE SumCheck protocol on Xilinx PYNQ-Z2 (Zynq-7020). Designed Montgomery multiplier units to reduce DSP utilization from 200+ to 8-12 per multiplier. Built a fused update+extension pipeline with banked scratchpads for memory-efficient MLE evaluation. Optimized resource allocation to fit within 220 DSP and 280 BRAM constraints using Vitis HLS and SystemVerilog.
Tech: Vitis HLS, SystemVerilog, C++, Python, Vivado, FPGA (Xilinx Zynq-7020)
Independent research project — paper reference: zkPHIRE (HPCA 2026)
RL-Based Autonomous Driving
Deep reinforcement learning for autonomous driving using the CARLA simulator. Implemented and compared policy gradient methods, DQN variants, and sim-to-real transfer strategies with real-world sensor data integration.
Tech: Python, PyTorch, CARLA, OpenAI Gym
github.com/ashesh8500/fp185 ↗
MediaSync — High-Performance Media Pipeline
Local-first media library pipeline: content-aware deduplication with SHA-256, hardware-accelerated transcoding to MP4, and batched cloud upload. Built in Rust with a terminal UI for large-scale photo/video library management.
Tech: Rust, FFmpeg, rclone, terminal UI
github.com/ashesh8500/mediasync ↗
Personal Website with On-Device AI Companion
Portfolio site with a browser-native AI assistant running a 1-bit quantized Bonsai-1.7B ONNX model on WebGPU. Built a Cloudflare Worker proxy for DeepSeek API with rate limiting and SSE streaming. Inference toggle between local WebGPU and cloud models.
Tech: JavaScript, ONNX Runtime Web, WebGPU, Cloudflare Workers, Transformers.js
asheshkaji.com ↗
Technical Skills
Languages:
PythonRustC/C++SystemVerilogJavaScriptSQLBash
ML & Quantitative Research:
PyTorchScikit-LearnNumPy/SciPyPandascvxpyStatsmodels PufferLib (RL)Nautilus TraderReinforcement Learning Convex OptimizationWalk-Forward BacktestingD3.js
FPGA & Hardware:
Vitis HLSVivadoXilinx ZynqFPGA DesignONNX RuntimeWebGPU
Infrastructure & MLOps:
DockerGitAWSAzureCloudflare Workers LangChainAI-SDKMCP/ACPRAGVector Databases LLMsGitHub Actions
Languages
English — Native/Bilingual Gujarati — Native/Bilingual Hindi — Native/Bilingual