Welcome to Vineeth Sai's World of Data!

Explore my Portfolio Website.

Hey there! I'm Vineeth, a data scientist with a passion for exploring the mysteries of the universe and using AI to make a positive impact. Whether it's discussing philosophy, cheering on my favorite sports team or diving into the latest tech, I'm always up for an engaging conversation.

About memy stats

Hey there! I'm Vineeth — an ML/AI Engineer and Data Scientist with curiosity far beyond code.

I’ve always been fascinated by the universe, astrophysics, and cosmology, while philosophy helps me explore life’s bigger questions. I love using machine learning to build solutions for real-world problems, especially in science and space research.

Outside of work, you’ll find me following sports highlights or geeking out over tech breakthroughs.

If you’re into AI, data, the universe, or sports, let’s connect!

40+

Projects Completed

2+

Years of Experience

30+

Happy Clients

15+

Technologies Proficient In

My Skills

Languages

Python, R, SQL, HTML/CSS, JavaScript, C, C++, Java

Frameworks & Libraries

PyTorch, TensorFlow, Keras, NumPy, Pandas, Matplotlib, Seaborn, Plotly, Tableau, MetaTF, spaCY, NLTK, cv2, PyRadiomics, Scikit-Learn, SciPy, Flask, Llama Index, NextJS, TypeScript, React, FastAPI, Streamlit

Tools & Platforms

Docker, Kubernetes, Tesseract, Textract, AWS, GCP, Azure, Google Speech-to-Text API, Heroku, Apache, QuantizeML, CNN2SNN, Chainlit, AWS Bedrock, AWS SageMaker, VertexAI, Azure AI Studio, MLflow, Kubeflow, Hugging Face Transformers, Deepgram API, CrewAI, LlamaIndex, AstraDB, WordPress, Webflow, ZOHO One, Click-Up, Miro, Git, GitHub, GitLab, Hadoop, Spark, Hive, NoSQL, MongoDB, ChromaDB, FAISS

Models, LLMs & Architectures

Mixtral 8x7B, Mistral 7B, Llama3 8B, LLaMA 2, Phi3, Gemma, Gemini, GPT 4, GPT 3.5, Grok, YOLO v5, YOLO v8, RoBERTa, LayoutLM v3, Lilt, BERT, XGBoost, Hough Lines Transform, T5, Longformer, ALBERT, VAEs, NCF, DPR, Llama 3.2 Vision, Whisper MLX, Kokoro-82M, Qwen 2.5

Concepts & Methodologies

RAG, Deep Learning, Natural Language Processing (NLP), LoRA (Low Rank Adaptation), Spiking Neural Network, Generative Adversarial Networks (GAN), Transformer Architecture, Diffusion Models, Machine Learning, Computer Vision, Neural Networks, Large Language Models, Data Mining, Database, SQL, Statistics, Graph Neural Networks (GNNs), Adversarial Machine Learning (PGD), Blockchain, Differential Privacy, Homomorphic Encryption, Digital Twin, ROS 2, Reinforcement Learning (RL), RLHF, Prompt Engineering, Bayesian Optimization, Gaussian Processes, Multimodal Models

My Timeline

Professional Experience

05/2025 - Present

Data Scientist- Google

AgentSpace Product

01/2025 - 06/2025

Research Scientist | AI Engineer - Aion Labs

Fine-tuning diffusion models with art data/media, building RAG systems and AI agentic workflows.

05/2024 - 08/2024

Data Scientist Intern - Nurjana Technologies

Developed real-time object detection models for space applications using SNN, and integrated QuantizeML & CNN2SNN.

02/2024 - 05/2024

Software Development Engineer | Data Scientist Intern - BambiHealth

Developed and deployed a speech-to-text solution using advanced TTS APIs while enhancing backend stability through rigorous code reviews.

08/2022 - 05/2023

Associate Data Scientist - Foundation AI

Implemented document processing pipelines using Hough Lines Transform, YOLO V5, Lilt, BERT, RoBERTa, and LayoutLM v3.

05/2022 - 09/2022

Junior Data Scientist Intern - Zummit Infolabs

Applied CNNs and PyRadiomics for image segmentation and advanced feature extraction in medical imaging.

04/2021 - 07/2021

Entrepreneur In Residence - Stirring Minds

Led product development using WordPress, Webflow, Discord, and Notion; managed AWS EC2 and integrated marketing tools.

On-Campus Roles & Involvement

08/2024 - Present

Research Assistant - University of the Pacific

Conducting research in ML for cyber-physical security, release note classification, Virtual TA using RAG, exoplanet discovery, and multimodal RAG.

05/2024 - 07/2024

Graduate Teaching Assistant - Deep Learning with PyTorch, UOP

Mentored students in neural networks, model optimization, and deployment while leading interactive workshops.

06/2024 - 08/2024

Co-Lecturer - Summer Program, UOP

Taught Python, NumPy, Pandas, and various ML models through interactive, hands-on sessions.

08/2024 - 12/2024

Graduate Teaching Assistant - Socratic Lab, UOP

Facilitated seminars on Math for Data Science, ML, and Databases; provided mentoring and comprehensive grading.

Volunteering & Leadership

Mar 2024 - Present

President (Pacific Data Science & AI Club) - University of the Pacific

Organizes high-impact events including Data Science Connect, hands-on workshops on building agentic workflows, and engaging meetups with outstanding turnout.

01/2019 - 04/2021

Founder/Coordinator (CODE.EXE - Coding Club of GNIT) - Undergrad

Built and led a coding community by organizing seminars, workshops, and competitions focused on data structures and algorithms.

Academic Programs

08/2023 - 05/2025

Master of Science in Data Science - University of the Pacific

Focus on Advanced ML/Deep Learning, NLP, Data Engineering, and Statistics.

2018 - 2022

Bachelor of Technology in Computer Science & Engineering - JNTU

Covered Machine Learning, Data Structures, Data Mining, AI, Cyber Security, and more.

My PortfolioMy Work

Here is some of my work that I've done in various programming languages.

AI Presentation Assistant

AI Presentation Assistant

AgenticRAG Research Assistant

AgenticRAG Research Assistant

AI Research Assistant

AI Research Assistant

AI-Powered Doc Crawler

AI-Powered Doc Crawler

Speech-to-Speech Chatbot

Speech-to-Speech Chatbot

Research Interpreter

Research Interpreter

Personalized Voice Assistant

Personalized Voice Assistant

Personalized Voice Assistant

Brain Tumor Detection

Research Interpreter

Text Abstractor

Text To Speech

Athletes Analysis

Encryption

Portfolio Website

My Blog

Research WorkResearch Work

Diffusion-Based Model Fine-Tuning for Art Media

Diffusion-Based Model Fine-Tuning for Domain-Specific Art

Description: Using advanced diffusion models to generate art tailored to specific artistic styles, harnessing LoRA for efficient and memory-friendly fine-tuning.

Professor: Dr. Aurelia M. Davidson

Audit Logging for Cyber-Physical Systems with Machine Learning

Description: Mitigating adversarial effects in robot programming through audit logging and machine learning. The system safeguards against attacks, providing real-time feedback to ensure secure operations.

Professor: Dr. Sepehr Amir-Mohammadian

Release Notes Classification and Prioritization Using Deep Learning

Description: Classifying release notes based on key words and context, prioritizing these updates, and using advanced deep learning models to build a recommendation engine for user upgrades.

Professor: Dr. Solomon Berhe

Virtual TA Using Retrieval-Augmented Generation

Description: Building an intelligent assistant that can help students with course material using RAG and fine-tuned open-source LLM models for security. This assistant answers students' questions contextually based on professor lecture materials.

Professor: Dr. Vivek Pallipuram

Exoplanet Discovery and Analysis

Description: Analyzing light dimming data from stars to discover and characterize exoplanets, calculating planet size, gravity, and other characteristics based on the light fluctuations and rotational speed of the stars.

Professor: Dr. Daniel Jontof-Hutter

Favourite BooksMy Favourites

The Fountainhead by Ayn Rand

Atlas Shrugged by Ayn Rand

Thus Spoke Zarathustra by Friedrich Nietzsche

Man's Search for Meaning by Victor Frankl

Meditations by Marcus Aurelius

The Almanack Of Naval Ravikant

Favourite SportsMy Favourites

Football

Cricket

Tennis

Basketball

Formula 1

Contact MeContact

Let's connect!

Hey there! I’m Vineeth — Data Scientist at Google, passionate about solving problems, building things, and exploring ideas.

Have a project, question, or just want to chat tech? Fill out the form or connect with me below.

San Francisco, CA

Data Scientist @ Google

vineethsai4444@gmail.com

English, Hindi, Telugu, French

GitHub LinkedIn Twitter LeetCode Kaggle HackerRank GeeksforGeeks