Ritesh Ratti
AI & Data Science Director with PhD in AI & Data Science
About Me

AI & Data Science Leader with 15+ years of experience in building cutting-edge machine learning products at global technology companies. Expert in Large Language Models (LLMs), Generative AI, and Deep Learning. Experienced in managing cross-functional teams to deliver AI-based products across APAC and European markets.
Experience
15+ years across global tech companies
EY
Director - AI and Data Science
AI x Data Centre of Excellence
- Leading cross-functional teams with Data Scientists, AI Engineers, and Developers to build software solutions
- Driving strategic team performance by implementing quarterly OKRs, prioritizing sprints, and developing project roadmaps while maintaining stakeholder-aligned backlogs
- Co-Lead for EY AI Platform powering Agentic AI solutions in Chatbot, Voice communication, and Predictive analytics domain
- Leading AI & ML projects in Gen AI, Agentic AI and ML space with consistent feedback delivery for AI project execution
Temus
Temasek owned Digital Transformation Company
Manager - AI and Data Science
AI x Data Centre of Excellence
- Led cross-functional AI and Data engineering team providing technical architecture expertise, code review support, and development mentorship
- Delivered Voice-to-Voice Sales Representative Engine using Azure STT, TTS, and OpenAI-4o, reducing onboarding time by 20% for 50+ concurrent users on Kubernetes
- Spearheaded VLM models for defect analysis and website migration error detection, improving defect identification accuracy by 15%
- Led Document Content Extraction system using VLM models to extract text, images, charts, and tables with 95% accuracy
Click to expand
HelloFresh
Data Science Lead
Personalization Team
- Technical Leader for personalization team, directing data scientists and ML engineers for end-to-end delivery of ML-based personalization products
- Spearheaded Recipe Recommendation and Recipe Ranking solutions using Random Forest for US and Canadian markets achieving 40% average recall@k
- Engineered Cold-start solution using vector database for similarity search and LLM-based Recipe recommendation for new customers
- Pioneered Generative AI solutions using Stable Diffusion for personalized marketing assets including customized taglines and banner images
Click to expand
Delivery Hero
Senior Data Scientist
Food Data Science Team
- Directed team of 3 data scientists, overseeing project management for food data science division with close regional team collaboration
- Designed multilingual Distil BERT for Food item categorization across 9 EU and 5 APAC countries achieving 80% weighted F1-score
- Delivered Multimodal dish classification using BERT + MobileNet with early fusion techniques, achieving 4-5% performance improvement
- Engineered Community Detection using graph analytics for customer segmentation, resulting in 2% increase in net revenue across Singapore and Dubai
- Architected Food Image classification using MobileNetV2 and EfficientNet for 500 food classes with 75% accuracy, plus Siamese networks for placeholder detection
Click to expand
Grab
Senior Data Scientist
Data Science Team
- Lead for data science solution development, mentoring team members and collaborating with managers to establish roadmaps for data science initiatives
- Spearheaded computer vision projects including Image Correction and Mart Item Tagging, leading Eater-side recommendation initiatives on big data infrastructure
- Implemented Food Mart Item categorization using Distil BERT for multi-label, multi-class predictions across 6 countries achieving 70% accuracy
- Engineered Food-Nonfood detector (75% accuracy) using MobileNetV2 and Multi-watermark detection (70% accuracy) using ResNet50, enhanced with YOLO-v3
- Developed Up-selling recommendations using FP Growth on Spark MLLib improving AOV by 5%, and Learning to Rank model improving NDCG by 1-2%
- Built time-series models for Food preparation time estimation using statistical analysis of sequential button-press data
Click to expand
Pitney Bowes
Senior Advisory Software Engineer - ML
CIM Team
- Technical Lead for NLP initiatives, spearheading architecture development and transforming project requirements into actionable development stories
- Designed Smart Data Quality solution using active learning with clustering and classification for advanced deduplication
- Developed Address Parser using MLP + Word2Vec achieving 75% accuracy for UK and Germany, with LSTM POC improving metrics by 2-4%
- Architected Entity Extractor using CRF models for CoNLL-2003 and MASC datasets, plus Text Categorization using SVM and MaxEnt
- Published research at ICSC 2018 (Los Angeles) on address parsing and IJCAI SML 2017 (Melbourne) on bank wire entity extraction
Click to expand
Wynk Limited
Senior Software Engineer - ML
- Led ML development for recommendation systems, contributing to Songs recommendation engine using Apache Storm-Trident aggregation
- Designed dynamic My Favorites feature analyzing user activity to calculate interest scores across Hadoop with Hive-generated analytics
- Implemented Google Now integration for Context-specific cards based on user activity patterns with automated album release notifications
- Architected distributed asynchronous Event processing framework using Storm and Kafka, achieving 500% improvement in transaction processing speed
Click to expand
Samsung Electronics
Technical Lead - ML
Advanced Technology Group
- ML Lead for Advanced text analytics using tf-IDF, topic modeling, and entity extraction on Hadoop with Elasticsearch indexing
- Designed Topic Extraction using LDA modeling with Question/Answer scoring using extracted topic probability distributions
- Architected map-reduce workflow for Textual Feature Extraction with Apache Oozie automation at Samsung HQ, Korea
- Developed CaaS infrastructure and camera use cases for real-time Contextual configuration suggestions
- Built configuration suggestion system using K-Means clustering with POS Tagging for voice-based command processing
- Engineered optimized caching scheme using Memcached improving object access performance by 20%, authored patent for Cross Region Caching
Click to expand
Oracle
Member Technical Staff
Database Security Development Team
- Software Developer for database security products, creating Log collectors and CLI tools for Oracle Audit Vault
- Designed and developed OSAUD, DBAUD log collectors and CSDK with push-based event model for AV11g platform
- Implemented logging schemes and Information Lifecycle Management systems for collector metrics
- Developed AVCLI command line utility with console interface including Firewall Management and Administration capabilities
Click to expand
Technical Skills
Expertise across the AI/ML stack
Programming Languages
Generative AI Technologies
Machine Learning Libraries
Data Processing Tools
Data Storage Systems
Cloud Technologies
Monitoring & Scheduling
Publications
13+ research papers in ML & Network Security
MVNIDS: A multiview-based network intrusion detection system
Journal of Information Security and Applications
Novel multiview approach to network intrusion detection combining multiple perspectives for improved threat identification.
Prompt Engineering-Based Network Intrusion Detection System
IEEE Access
Leveraging prompt engineering techniques with LLMs for network intrusion detection and classification.
Protocol Aware Unsupervised Network Intrusion Detection System
IEEE TrustCom 2023
Unsupervised learning approach for network intrusion detection with protocol-specific feature engineering.
Network based Intrusion Detection using Time aware LSTM Autoencoder
IEEE TrustCom-EUC 2023
Deep learning architecture using LSTM autoencoders with temporal awareness for network anomaly detection.
Online Network Attack Detection using Statistical Features
IEEE ANTS 2021
Real-time network attack detection system utilizing statistical feature extraction and machine learning.
Towards implementing fast and scalable Network Intrusion Detection System using Entropy based Discretization Technique
IEEE ICCCNT 2020
Scalable intrusion detection approach using entropy-based discretization for efficient network traffic analysis.
Automated parsing of geographical addresses: A multilayer feedforward neural network based approach
IEEE ICSC 2018
Neural network architecture for parsing and understanding geographical address components across formats.
Semantic extraction of Named Entities from Bank Wire text
IJCAI SML Workshop 2017
Deep learning approach for extracting named entities and semantic information from financial wire transfer text.
An active detection mechanism for detecting ICMP based attacks
IEEE TrustCom 2012
Active monitoring system for real-time detection of ICMP-based network attacks and vulnerabilities.
An active host-based detection mechanism for ARP-related attacks
International Conference on Computer Science and Information Technology
Host-based detection mechanism for identifying and mitigating ARP spoofing and related LAN attacks.
LAN attack detection using discrete event systems
ISA Transactions
Discrete event systems approach for modeling and detecting various LAN-specific attack patterns.
An active intrusion detection system for LAN specific attacks
International Conference on Advanced Computer Science and Information Technology
Active intrusion detection framework specifically designed for detecting attacks in local area networks.
A DES approach to intrusion detection system for ARP spoofing attacks
IEEE Mediterranean Conference on Control and Automation (MED)
Discrete Event Systems methodology applied to ARP spoofing attack detection in network environments.
Blog
Technical insights on AI, ML, and Data Science
Building Chatbot using LLM based Retrieval Augmented Generation Method
Learn how to develop intelligent chatbots using Large Language Models and RAG architecture to utilize knowledge bases for answering complex queries.
An Introduction to Multimodal Machine Learning
Explore computer algorithms that learn from multiple data modalities including text, images, audio, and behavioral data for enhanced AI systems.
Learning to Rank Algorithms
Deep dive into machine learning methods that solve ranking problems using supervised learning to discover optimal ordering for item lists.
Evolution of Recommender Systems
Trace the journey from traditional to modern deep learning-based recommender systems using MLP, LSTM, and attention-based models.
Custom Text Data Augmentation using NLPAug, SentencePiece, and Tensorflow
Techniques for data augmentation to provide varied training data, reduce overfitting, and produce more generalized machine learning models.
Batch Normalization Demystified
Understanding Batch Normalization as a technique for training deep neural networks by standardizing inputs to each mini-batch layer.
Speaking & Conferences
17+ international presentations across the globe
Global IT Security Summit
Unsupervised Learning methods for NIDS
Indian Institute of Technology
Evolution of Recommender Systems
Hasso Plattner Institute
Deep learning based methods for Network Intrusion detection systems
Digital Brand Protection Forum, Brainlinx
Securing network using deep learning based detection systems
TrustCom-EUC 2023
Network based intrusion detection using Time aware LSTM Autoencoder
TrustCom 2023
Protocol aware unsupervised network intrusion detection system
Delivery Hero Global Tech Summit
Dish Catalog Optimization
Master Class IT Security - PGS
Deep Learning methods for Network Intrusion Detection
Big Data Innovation Conference - PGS
Developing Transformers-based models for Text Analytics
Global IT Security Summit - PGS
Network based attack detection using Machine Learning
IEEE ANTS 2021
Online Network Attack Detection using Statistical Features
AICTE Faculty Development Program, LBRCE
Data Science and its Applications
Faculty Development Program, LBRCE
Applied Data Science: Tools and Techniques
ICCCNT 2020, IIT Kharagpur
Towards implementing fast and scalable Network Intrusion Detection System using Entropy based Discretization Technique
Grab Hackathon
Feature Engineering and Model Building
IJCAI Workshop on Semantic Machine Learning
Semantic extraction of Named Entities from Bank Wire text
International Conference on Advanced Communication and Networking
An active intrusion detection system for LAN specific attacks
Awards & Education
Education
PhD (Part Time) in Computer Science and Engineering
Indian Institute of Technology, Guwahati
July 2015 - June 2024 | Guwahati, India
Application specific Network Intrusion Detection System (An unsupervised perspective)
M.Tech in Computer Science and Engineering
Indian Institute of Technology, Guwahati
July 2008 - June 2010 | Guwahati, India
Active Detection mechanisms for attacks in Autonomous Systems
CPI: 9.1/10Awards & Recognition
Samsung Employee of the Month
Samsung Research
Recognition for outstanding contributions to text analytics solutions
PB Star Award
Pitney Bowes
Excellence in NLP and machine learning initiatives
Spectrum CIM Hackathon Winner
Pitney Bowes
2017First place in company-wide hackathon for innovative ML solution
Patent
Method and system for updating a local cache database
Patent for innovative caching mechanism in distributed systems
Professional Activities
• IEEE Conference Reviewer (ANTS, GCON)
• Active Speaker at International Tech Conferences
Get In Touch
Discuss AI opportunities, mentorship, or collaboration
Connect With Me
Always interested in discussing AI/ML projects, research collaborations, or speaking opportunities.
Schedule a Meeting
Mentorship & Consultation
© 2026 Ritesh Ratti. Built with Next.js & Framer Motion.