About memy stats
Hey there! I'm Vineeth, a data scientist by trade, but my curiosity stretches far beyond the realm of code.
Ever since I was a kid, I've been fascinated by the universe's mysteries, absorbing everything I could about astrophysics and cosmology. This curiosity extends to the human experience, and philosophy has been a constant companion, helping me navigate life's big questions.
On a lighter note, I'm a huge sports fan (you can probably guess my favorite team by my terrible attempts at celebrating with air high-fives). And when I'm not pondering the cosmos or cheering on my team, you'll find me glued to the latest advancements in tech – that's where my passion for data science comes in.
Machine learning excites me because it allows me to use data to create intelligent solutions for real-world problems. Particularly, I'm interested in using AI to make a positive impact in the healthcare industry.
If you're interested in data science, AI, philosophy, the universe, or just want to chat about the latest tech breakthroughs (or even argue about sports!), feel free to reach out! I'm always up for a stimulating conversation.
40+
Projects
Completed
2+
Years of
experience
30+
Happy
Clients
15+
Technologies
Proficient In
My Skills
Languages
Python, R, SQL, HTML/CSS, JavaScript, C, C++, Java
Frameworks & Libraries
PyTorch, TensorFlow, Keras, NumPy, Pandas, Matplotlib, Seaborn, Plotly, Tableau, MetaTF, spaCY, NLTK, cv2, PyRadiomics, Scikit-Learn, SciPy, Flask, Llama Index, NextJS, TypeScript, React, FastAPI, Streamlit
Tools & Platforms
Docker, Kubernetes, Tesseract, Textract, AWS, GCP, Azure, Google Speech-to-Text API, Heroku, Apache, QuantizeML, CNN2SNN, Chainlit, AWS Bedrock, AWS SageMaker, VertexAI, Azure AI Studio, MLflow, Kubeflow, Hugging Face Transformers, Deepgram API, CrewAI, LlamaIndex, AstraDB, WordPress, Webflow, ZOHO One, Click-Up, Miro, Git, GitHub, GitLab, Hadoop, Spark, Hive, NoSQL, MongoDB, ChromaDB, FAISS
Models, LLMs & Architectures
Mixtral 8x7B, Mistral 7B, Llama3 8B, LLaMA 2, Phi3, Gemma, Gemini, GPT 4, GPT 3.5, Grok, YOLO v5, YOLO v8, RoBERTa, LayoutLM v3, Lilt, BERT, XGBoost, Hough Lines Transform, T5, Longformer, ALBERT, VAEs, NCF, DPR, Llama 3.2 Vision, Whisper MLX, Kokoro-82M, Qwen 2.5
Concepts & Methodologies
RAG, Deep Learning, Natural Language Processing (NLP), LoRA (Low Rank Adaptation), Spiking Neural Network, Generative Adversarial Networks (GAN), Transformer Architecture, Diffusion Models, Machine Learning, Computer Vision, Neural Networks, Large Language Models, Data Mining, Database, SQL, Statistics, Graph Neural Networks (GNNs), Adversarial Machine Learning (PGD), Blockchain, Differential Privacy, Homomorphic Encryption, Digital Twin, ROS 2, Reinforcement Learning (RL), RLHF, Prompt Engineering, Bayesian Optimization, Gaussian Processes, Multimodal Models
My Timeline
Professional Experience
01/2025 - Present
Research Scientist | AI Engineer - Aion Labs
Fine-tuning diffusion models with art data/media, building RAG apps, and designing agentic workflows.
05/2024 - 08/2024
Data Scientist Intern - Nurjana Technologies
Developed real-time object detection models for space applications using SNN, and integrated QuantizeML & CNN2SNN.
02/2024 - 05/2024
Software Development Engineer | Data Scientist Intern - BambiHealth
Developed and deployed a speech-to-text solution using advanced TTS APIs while enhancing backend stability through rigorous code reviews.
08/2022 - 05/2023
Associate Data Scientist - Foundation AI
Implemented document processing pipelines using Hough Lines Transform, YOLO V5, Lilt, BERT, RoBERTa, and LayoutLM v3.
05/2022 - 09/2022
Junior Data Scientist Intern - Zummit Infolabs
Applied CNNs and PyRadiomics for image segmentation and advanced feature extraction in medical imaging.
04/2021 - 07/2021
Entrepreneur In Residence - Stirring Minds
Led product development using WordPress, Webflow, Discord, and Notion; managed AWS EC2 and integrated marketing tools.
On-Campus Roles & Involvement
08/2024 - Present
Research Assistant - University of the Pacific
Conducting research in ML for cyber-physical security, release note classification, Virtual TA using RAG, exoplanet discovery, and multimodal RAG.
05/2024 - 07/2024
Graduate Teaching Assistant - Deep Learning with PyTorch, UOP
Mentored students in neural networks, model optimization, and deployment while leading interactive workshops.
06/2024 - 08/2024
Co-Lecturer - Summer Program, UOP
Taught Python, NumPy, Pandas, and various ML models through interactive, hands-on sessions.
08/2024 - 12/2024
Graduate Teaching Assistant - Socratic Lab, UOP
Facilitated seminars on Math for Data Science, ML, and Databases; provided mentoring and comprehensive grading.
Volunteering & Leadership
Mar 2024 - Present
President (Pacific Data Science & AI Club) - University of the Pacific
Organizes high-impact events including Data Science Connect, hands-on workshops on building agentic workflows, and engaging meetups with outstanding turnout.
01/2019 - 04/2021
Founder/Coordinator (CODE.EXE - Coding Club of GNIT) - Undergrad
Built and led a coding community by organizing seminars, workshops, and competitions focused on data structures and algorithms.
Academic Programs
08/2023 - 05/2025
Master of Science in Data Science - University of the Pacific
Focus on Advanced ML/Deep Learning, NLP, Data Engineering, and Statistics.
2018 - 2022
Bachelor of Technology in Computer Science & Engineering - JNTU
Covered Machine Learning, Data Structures, Data Mining, AI, Cyber Security, and more.
My PortfolioMy Work
Here is some of my work that I've done in various programming languages.
Research WorkResearch Work

Diffusion-Based Model Fine-Tuning for Domain-Specific Art
Description: Using advanced diffusion models to generate art tailored to specific artistic styles, harnessing LoRA for efficient and memory-friendly fine-tuning.
Professor: Dr. Aurelia M. Davidson

Audit Logging for Cyber-Physical Systems with Machine Learning
Description: Mitigating adversarial effects in robot programming through audit logging and machine learning. The system safeguards against attacks, providing real-time feedback to ensure secure operations.
Professor: Dr. Sepehr Amir-Mohammadian

Release Notes Classification and Prioritization Using Deep Learning
Description: Classifying release notes based on key words and context, prioritizing these updates, and using advanced deep learning models to build a recommendation engine for user upgrades.
Professor: Dr. Solomon Berhe

Virtual TA Using Retrieval-Augmented Generation
Description: Building an intelligent assistant that can help students with course material using RAG and fine-tuned open-source LLM models for security. This assistant answers students' questions contextually based on professor lecture materials.
Professor: Dr. Vivek Pallipuram

Exoplanet Discovery and Analysis
Description: Analyzing light dimming data from stars to discover and characterize exoplanets, calculating planet size, gravity, and other characteristics based on the light fluctuations and rotational speed of the stars.
Professor: Dr. Daniel Jontof-Hutter
Favourite BooksMy Favourites

The Fountainhead by Ayn Rand

Atlas Shrugged by Ayn Rand

Thus Spoke Zarathustra by Friedrich Nietzsche

Man's Search for Meaning by Victor Frankl

Meditations by Marcus Aurelius

The Almanack Of Naval Ravikant
Favourite SportsMy Favourites

Football

Cricket

Tennis

Basketball

Formula 1
Contact MeContact
Let's connect!
Hey there! This is Vineeth, and I'd love to hear from you.
Whether you have a question, a project in mind, or just want to connect, feel free to reach out.
I'm here to learn!
San Francisco, CA
vineethsai4444@gmail.com
University of the Pacific, SF
English, Hindi, Telugu, French