Welcome to Vineeth Sai's World of Data!

Explore my Portfolio Website.

Hey there! I'm Vineeth, a data scientist with a passion for exploring the mysteries of the universe and using AI to make a positive impact. Whether it's discussing philosophy, cheering on my favorite sports team or diving into the latest tech, I'm always up for an engaging conversation.

About memy stats

Hey there! I'm Vineeth, a data scientist by trade, but my curiosity stretches far beyond the realm of code.

Ever since I was a kid, I've been fascinated by the universe's mysteries, absorbing everything I could about astrophysics and cosmology. This curiosity extends to the human experience, and philosophy has been a constant companion, helping me navigate life's big questions.

On a lighter note, I'm a huge sports fan (you can probably guess my favorite team by my terrible attempts at celebrating with air high-fives). And when I'm not pondering the cosmos or cheering on my team, you'll find me glued to the latest advancements in tech – that's where my passion for data science comes in.

Machine learning excites me because it allows me to use data to create intelligent solutions for real-world problems. Particularly, I'm interested in using AI to make a positive impact in the healthcare industry.

If you're interested in data science, AI, philosophy, the universe, or just want to chat about the latest tech breakthroughs (or even argue about sports!), feel free to reach out! I'm always up for a stimulating conversation.

40+

Projects
Completed

2+

Years of
experience

30+

Happy
Clients

15+

Technologies
Proficient In

My Skills

Languages

Python, R, SQL, HTML/CSS, JavaScript, C, C++, Java

Frameworks & Libraries

PyTorch, TensorFlow, Keras, NumPy, Pandas, Matplotlib, Seaborn, Plotly, Tableau, MetaTF, spaCY, NLTK, cv2, PyRadiomics, Scikit-Learn, SciPy, Flask, Llama Index, NextJS, TypeScript, React, FastAPI, Streamlit

Tools & Platforms

Docker, Kubernetes, Tesseract, Textract, AWS, GCP, Azure, Google Speech-to-Text API, Heroku, Apache, QuantizeML, CNN2SNN, Chainlit, AWS Bedrock, AWS SageMaker, VertexAI, Azure AI Studio, MLflow, Kubeflow, Hugging Face Transformers, Deepgram API, CrewAI, LlamaIndex, AstraDB, WordPress, Webflow, ZOHO One, Click-Up, Miro, Git, GitHub, GitLab, Hadoop, Spark, Hive, NoSQL, MongoDB, ChromaDB, FAISS

Models, LLMs & Architectures

Mixtral 8x7B, Mistral 7B, Llama3 8B, LLaMA 2, Phi3, Gemma, Gemini, GPT 4, GPT 3.5, Grok, YOLO v5, YOLO v8, RoBERTa, LayoutLM v3, Lilt, BERT, XGBoost, Hough Lines Transform, T5, Longformer, ALBERT, VAEs, NCF, DPR, Llama 3.2 Vision, Whisper MLX, Kokoro-82M, Qwen 2.5

Concepts & Methodologies

RAG, Deep Learning, Natural Language Processing (NLP), LoRA (Low Rank Adaptation), Spiking Neural Network, Generative Adversarial Networks (GAN), Transformer Architecture, Diffusion Models, Machine Learning, Computer Vision, Neural Networks, Large Language Models, Data Mining, Database, SQL, Statistics, Graph Neural Networks (GNNs), Adversarial Machine Learning (PGD), Blockchain, Differential Privacy, Homomorphic Encryption, Digital Twin, ROS 2, Reinforcement Learning (RL), RLHF, Prompt Engineering, Bayesian Optimization, Gaussian Processes, Multimodal Models

My Timeline

Professional Experience

01/2025 - Present

Research Scientist | AI Engineer - Aion Labs

Fine-tuning diffusion models with art data/media, building RAG apps, and designing agentic workflows.

05/2024 - 08/2024

Data Scientist Intern - Nurjana Technologies

Developed real-time object detection models for space applications using SNN, and integrated QuantizeML & CNN2SNN.

02/2024 - 05/2024

Software Development Engineer | Data Scientist Intern - BambiHealth

Developed and deployed a speech-to-text solution using advanced TTS APIs while enhancing backend stability through rigorous code reviews.

08/2022 - 05/2023

Associate Data Scientist - Foundation AI

Implemented document processing pipelines using Hough Lines Transform, YOLO V5, Lilt, BERT, RoBERTa, and LayoutLM v3.

05/2022 - 09/2022

Junior Data Scientist Intern - Zummit Infolabs

Applied CNNs and PyRadiomics for image segmentation and advanced feature extraction in medical imaging.

04/2021 - 07/2021

Entrepreneur In Residence - Stirring Minds

Led product development using WordPress, Webflow, Discord, and Notion; managed AWS EC2 and integrated marketing tools.

On-Campus Roles & Involvement

08/2024 - Present

Research Assistant - University of the Pacific

Conducting research in ML for cyber-physical security, release note classification, Virtual TA using RAG, exoplanet discovery, and multimodal RAG.

05/2024 - 07/2024

Graduate Teaching Assistant - Deep Learning with PyTorch, UOP

Mentored students in neural networks, model optimization, and deployment while leading interactive workshops.

06/2024 - 08/2024

Co-Lecturer - Summer Program, UOP

Taught Python, NumPy, Pandas, and various ML models through interactive, hands-on sessions.

08/2024 - 12/2024

Graduate Teaching Assistant - Socratic Lab, UOP

Facilitated seminars on Math for Data Science, ML, and Databases; provided mentoring and comprehensive grading.

Volunteering & Leadership

Mar 2024 - Present

President (Pacific Data Science & AI Club) - University of the Pacific

Organizes high-impact events including Data Science Connect, hands-on workshops on building agentic workflows, and engaging meetups with outstanding turnout.

01/2019 - 04/2021

Founder/Coordinator (CODE.EXE - Coding Club of GNIT) - Undergrad

Built and led a coding community by organizing seminars, workshops, and competitions focused on data structures and algorithms.

Academic Programs

08/2023 - 05/2025

Master of Science in Data Science - University of the Pacific

Focus on Advanced ML/Deep Learning, NLP, Data Engineering, and Statistics.

2018 - 2022

Bachelor of Technology in Computer Science & Engineering - JNTU

Covered Machine Learning, Data Structures, Data Mining, AI, Cyber Security, and more.

My PortfolioMy Work

Here is some of my work that I've done in various programming languages.

AI Presentation Assistant

AI Presentation Assistant

AgenticRAG Research Assistant

AgenticRAG Research Assistant

AI Research Assistant

AI Research Assistant

AI-Powered Doc Crawler

AI-Powered Doc Crawler

Speech-to-Speech Chatbot

Speech-to-Speech Chatbot

Research Interpreter

Research Interpreter

Personalized Voice Assistant

Personalized Voice Assistant

Personalized Voice Assistant

Brain Tumor Detection

Research Interpreter

Text Abstractor

Text To Speech

Athletes Analysis

Encryption

Portfolio Website

My Blog

Research WorkResearch Work

Diffusion-Based Model Fine-Tuning for Art Media

Diffusion-Based Model Fine-Tuning for Domain-Specific Art

Description: Using advanced diffusion models to generate art tailored to specific artistic styles, harnessing LoRA for efficient and memory-friendly fine-tuning.

Professor: Dr. Aurelia M. Davidson

Audit Logging for Cyber-Physical Systems with Machine Learning

Description: Mitigating adversarial effects in robot programming through audit logging and machine learning. The system safeguards against attacks, providing real-time feedback to ensure secure operations.

Professor: Dr. Sepehr Amir-Mohammadian

Release Notes Classification and Prioritization Using Deep Learning

Description: Classifying release notes based on key words and context, prioritizing these updates, and using advanced deep learning models to build a recommendation engine for user upgrades.

Professor: Dr. Solomon Berhe

Virtual TA Using Retrieval-Augmented Generation

Description: Building an intelligent assistant that can help students with course material using RAG and fine-tuned open-source LLM models for security. This assistant answers students' questions contextually based on professor lecture materials.

Professor: Dr. Vivek Pallipuram

Exoplanet Discovery and Analysis

Description: Analyzing light dimming data from stars to discover and characterize exoplanets, calculating planet size, gravity, and other characteristics based on the light fluctuations and rotational speed of the stars.

Professor: Dr. Daniel Jontof-Hutter

Favourite BooksMy Favourites

The Fountainhead by Ayn Rand

Atlas Shrugged by Ayn Rand

Thus Spoke Zarathustra by Friedrich Nietzsche

Man's Search for Meaning by Victor Frankl

Meditations by Marcus Aurelius

The Almanack Of Naval Ravikant

Favourite SportsMy Favourites

Football

Cricket

Tennis

Basketball

Formula 1

Contact MeContact

Let's connect!

Hey there! This is Vineeth, and I'd love to hear from you.

Whether you have a question, a project in mind, or just want to connect, feel free to reach out.

I'm here to learn!

San Francisco, CA

vineethsai4444@gmail.com

University of the Pacific, SF

English, Hindi, Telugu, French