Skip to main content

Summary

Built production LLM systems for therapeutic chatbots, including prompt engineering, safety guardrails, and RAG pipelines using GPT-3.5/4 and Claude. Previously developed ML/NLP systems across fintech, healthcare, and quantitative research for over a decade.

Work primarily in Python with strong functional programming background (Clojure, Elixir). MSc Computational Linguistics, BSc Nuclear Physics.

Experience

Co-Founder & Technical Lead

founder
Fintilligence • Remote

Built fintech analytics platform for detecting asymmetric information flow using PIN/VPIN market microstructure measures

  • Implemented PIN/VPIN (Probability of Informed Trading/Volume-Synchronized PIN) measures for market microstructure analysis visualization →
  • Designed and deployed data acquisition pipelines processing Level 2 market data (order book, bid-ask spreads, trade flow) code ⟨/⟩
  • Managed PostgreSQL database architecture for high-frequency trade and quote data storage
  • Coordinated small team (3 people) across research, development, and UI design
  • Built investment advisory system highlighting potential information asymmetries and trading opportunities
Python PostgreSQL pandas Flask AWS EC2 AWS S3

CTO

contract
LevEhat NGO • Remote

Led technical operations for civic tech nonprofit, managing cloud migration and platform development

  • Managed migration from Google Cloud Platform to AWS infrastructure (cost optimization)
  • Coordinated UI designers and developers (team of 5) for volunteer management platform
  • Oversaw database architecture for volunteer tracking, task assignment, and activity logging
  • Established development priorities and technical roadmap for platform evolution
Python PostgreSQL AWS EC2 AWS S3 React

Independent Consultant

consulting
Various Clients • Remote

ML and data consulting: model development, pipeline architecture, statistical analysis

  • Projects: time-series forecasting, text classification, data pipeline optimization

Senior Researcher

full-time
Spring Research • Remote

ML models for trading signal generation using Level 2 market data and topology-inspired approaches

  • Developed ML models for trading signal generation using time-series analysis and statistical methods
  • Built data pipelines processing Level 2 market data (tick-by-tick, order book, market depth)
  • Researched topology-inspired approaches to market microstructure modeling
  • Implemented backtesting infrastructure for strategy evaluation
Python pandas numpy scikit-learn PostgreSQL AWS

Data Scientist

full-time
Nestlogic • Remote

Computer vision models for advertising optimization, A/B testing infrastructure, production ML deployment

  • Built computer vision models for advertising creative optimization (image feature extraction)
  • Implemented A/B testing infrastructure using statistical hypothesis testing (t-tests, chi-square)
  • Deployed ML models to production on Google Cloud Platform with Kubernetes
  • Developed analytics dashboards tracking model performance and business KPIs
Python PyTorch OpenCV PostgreSQL GCP Kubernetes

Data Scientist

full-time
Maverick Medical AI • Remote

Medical NLP system for clinical entity recognition, ontology frameworks, HIPAA-compliant data handling

  • Developed NLP system for medical named entity recognition in clinical text using spaCy and BiLSTM
  • Built medical ontology framework for standardizing terminology across different hospital systems
  • Created decision support tools for clinical workflows highlighting critical findings
  • Worked within HIPAA compliance requirements for healthcare data
Python spaCy PyTorch Flask PostgreSQL R Mathematica

Technical Skills

Programming

  • Python (10+ years production)
  • Clojure (10+ years)
  • Rust (5+ years)
  • Go (3+ years)
  • Elixir (2+ years)
  • Java, R, SQL (advanced), bash
  • Functional background: Common Lisp, Haskell

ML/AI

  • Frameworks: PyTorch, TensorFlow, Keras, scikit-learn, XGBoost, LightGBM, CatBoost
  • NLP: spaCy, NLTK, Hugging Face Transformers
  • Deep Learning: CNNs, RNNs, LSTMs, GRUs, Transformers, attention mechanisms, transfer learning, fine-tuning

Cloud & Infrastructure

  • AWS: EC2, S3, SageMaker, Lambda
  • GCP: GCE, GCS, GKE
  • Containers: Docker, Kubernetes

Databases

  • PostgreSQL (expert)
  • MongoDB, Redis, Neo4j, SQLite, MySQL

Data Engineering

  • Apache Spark, Apache Airflow, Kafka
  • ETL pipelines, data quality, orchestration

Web & APIs

  • Flask, FastAPI, Django
  • REST APIs, microservices architecture

DevOps

  • git, Linux, CI/CD (GitHub Actions, Jenkins)
  • Monitoring, logging

Domain Expertise

LLM Production Systems
3 years • Stamina AI
Therapeutic chatbots, safety systems, prompt engineering, RAG pipelines
Quantitative Finance
2+ years • Fintilligence, Spring Research
Level 2 market data, PIN/VPIN algorithms, order flow analysis, backtesting
Healthcare AI
1 year • Maverick Medical AI
Medical NER, clinical terminology, HIPAA compliance, decision support tools
Financial NLP
1 year • Athena Portfolio Solutions
SEC filings analysis, knowledge graphs, entity linking, sentiment analysis

Education

MSc Computational Linguistics

2016

Russian State University for the Humanities (RSUH) • Moscow, Russia

Statistical NLP, Machine Translation, Information Extraction

BSc Nuclear Physics

2011

Czech Technical University • Prague, Czech Republic

Mathematical Modeling, Statistical Analysis, Computational Physics

Languages

English (Fluent), Russian (Native), French (Conversational), German (Conversational), Czech (Conversational), Hebrew (Basic)