Applied Artificial Intelligence and Machine Learning
June 8, 2025 · View on GitHub
Introduction
Applied AI is a curated knowledge repository of artificial intelligence & machine learning use cases, best practices, lessons learned, tools, and techniques, adopted by leading technology or tech-savvy organizations.
Numerous organizations frequently share their insights and expertise, encompassing best practices, tools, and techniques that shape their engineering culture. They do this through various public platforms such as engineering blogs, conferences, and meetups. This repository compiles and presents content gathered from these sources.
Topics
- Artificial Intelligence
- Machine Learning
- Deep Learning
- Generative AI
- Large Language Models
- Prompt Engineering
- Retrieval-Augmented Generation (RAG)
- MLOps
- Data Engineering
- Responsible AI
Organizations
Airbnb
Blog Posts
2024
- Airbnb’s AI-powered photo tour using Vision Transformer
- Automation Platform v2: Improving Conversational AI at Airbnb
- Transforming Location Retrieval at Airbnb: A Journey from Heuristics to Reinforcement Learning
- Chronon, Airbnb’s ML Feature Platform, Is Now Open Source
- Airbnb Brandometer: Powering Brand Perception Measurement on Social Media Data with AI
2023
- Prioritizing Home Attributes Based on Guest Interest
- Learning To Rank Diversely
- Building Airbnb Categories with ML & Human in the Loop
- Wisdom of Unstructured Data: Building Airbnb’s Listing Knowledge from Big Text Data
2022
- How AI Text Generation Models Are Reshaping Customer Support at Airbnb
- Intelligent Automation Platform: Empowering Conversational AI and Beyond at Airbnb
2021
2020
Algolia
Blog Posts
2024
- Introducing AI Personalization (𝛽)
- How large-language models are changing ecommerce
- What does it mean for AI search to “understand” customers?
- The future of B2B is powered by AI. Here’s how to get started.
- The definitive guide to semantic search engines
2023
Autotrader
Blog Posts
2024
- Demystifying Large Language Models (LLM101)
- To App or Not to App (Rapid Data App Prototyping in the ML Experimentation Cycle)
- So many labels, so little time; accelerating our image labelling process
2023
2022
BlaBlaCar
Blog Posts
2023
- How we used machine learning to fight fraud at BlaBlaCar — Part 1
- How we built our machine learning pipeline to fight fraud at BlaBlaCar — Part 2
- How BlaBlaCar leverages machine learning to match passengers and drivers - Part 2
- How BlaBlaCar leverages machine learning to match passengers and drivers - Part 1
Dropbox
Blog Posts
2025
2024
- How we use Lakera Guard to secure our LLMs
- Bye Bye Bye...: Evolution of repeated token attacks on ChatGPT models
- Bringing AI-powered answers and summaries to file previews on the web
- From AI to sustainability, why our latest data centers use 400G networking
2023
- Putting everything in its right place with ML-powered file organization
- Is this a date? Using ML to identify date formats in file names
- Dont you (forget NLP): Prompt injection with control characters in ChatGPT
- Accelerating our A/B experiments with machine learning
2021
eBay
Blog Posts
2024
- Background Enhancement Tool Turns Any Photo Into a Studio-Quality Product Image
- eBay's Responsible AI Principles
- Cutting Through the Noise: Three Things We've Learned About Generative AI and Developer Productivity
- Podcast: Nitzan Mekel-Bobrov on AI, the Future of Shopping Online, and the Value of Building In-House
2023
- New Social Caption Generator Uses AI to Help Sellers Post More Easily
- eBay Exec on How Artificial Intelligence Will Bring a ‘Paradigm Shift’ to Ecommerce
- ‘Magical’ Listing Tool Harnesses the Power of AI to Make Selling on eBay Faster, Easier, and More Accurate
- Evolving Recommendations: A Personalized User-Based Ranking Model
- Beyond Words: How Multimodal Embeddings Elevate eBay's Product Recommendations
- eBay Execs Talk Generative AI and Computer Vision at VentureBeat Transform Conference
- eBay’s Blazingly Fast Billion-Scale Vector Similarity Engine
- How eBay Created a Language Model With Three Billion Item Titles
2022
Etsy
Blog Posts
2024
2023
- How We Built a Multi-Task Canonical Ranker for Recommendations at Etsy
- Leveraging Real-Time User Actions to Personalize Etsy Ads - Research Paper
- From Image Classification to Multitask Modeling: Building Etsy’s Search by Image Feature
- Leveraging Real-Time User Actions to Personalize Etsy Ads
2022
Expedia
Blog Posts
2024
- Learning Embeddings for Lodging Travel Concepts
- Traveling Just Got a Lot Smarter with Romie
- Choosing the Right Candidates for Lodging Ranking
2023
- Using Synthetic Search Data for Flights Price Forecasting
- Expedia Group’s Customer Lifetime Value Prediction Model
- Generating Diverse Travel Recommendations
- Increasing Travelers’ Engagement Through Price Alerts
- Candidate Generation Using a Two Tower Approach With Expedia Group Traveler Data
2022
- Categorising Customer Feedback Using Unsupervised Learning
- How to Optimise Rankings with Cascade Bandits
2021
GitHub
Blogs
2024
- Unlocking the power of unstructured data with RAG
- What is retrieval-augmented generation, and what does it do for generative AI?
- Hard and soft skills for developers coding in the age of AI
- How AI code generation works
- Fixing security vulnerabilities with AI
- A developer’s second brain: Reducing complexity through partnership with AI
2023
- How we’re experimenting with LLMs to evolve GitHub Copilot
- The architecture of today’s LLM applications
- Demystifying LLMs: How they can do things they weren’t trained to do
- How to build an enterprise LLM application: Lessons from GitHub Copilot
- A developer’s guide to prompt engineering and LLMs
- Inside GitHub: Working with the LLMs behind GitHub Copilot
- How companies are boosting productivity with generative AI
- How generative AI is changing the way developers work
- Generative AI-enabled compliance for software development
- What developers need to know about generative AI
2022
Grab
Blog Posts
2025
- From failure to success: The birth of GrabGPT, Grab’s internal ChatGPT
- Streamlining RiskOps with the SOP agent framework
- Introducing the SOP-driven LLM agent frameworks
- Grab AI Gateway: Connecting Grabbers to multiple GenAI providers
2024
- Supercharging LLM application development with LLM-Kit
- Metasense V2: Enhancing, improving and productionisation of LLM powered data governance
- LLM-assisted vector similarity search
- Leveraging RAG-powered LLMs for Analytical Tasks
- Evolution of Catwalk: Model serving platform at Grab
- Enabling conversational data discovery with LLMs at Grab
- Unveiling the process: The creation of our powerful campaign builder
- LLM-powered data classification for data entities at scale
Instacart
Blog Posts
2024
- Enhancing FoodStorm with AI Image Generation
- Distinguished Speaker Series with Ping Li: ML-Enhanced Sparse Vector Search with Privacy Protection
- Unveiling the Core of Instacart’s Griffin 2.0: A Deep Dive Into the Model Serving Platform
- Unlocking Efficiency: How Ava Became Our AI Productivity Partner
2023
- One model to serve them all
- Monte Carlo, Puppetry and Laughter: The Unexpected Joys of Prompt Engineering
- Unveiling the Core of Instacart’s Griffin 2.0: A Deep Dive into the Machine Learning Training Platform
- Introducing Griffin 2.0: Instacart’s Next-Gen ML Platform
- Scaling Productivity with Ava — Instacart’s Internal AI Assistant
- Supercharging ML/AI Foundations at Instacart
- Adopting dbt as the Data Transformation Tool at Instacart
- The Next Era of Data at Instacart
- How Instacart Modernized the Prediction of Real Time Availability for Hundreds of Millions of Items While Saving Costs
Mercari
Blog Posts
2025
- Rethink Tool's UI/UX - Human-Centric to AI-Driven
- Tackling Knowledge Management
- LLM x SRE: Mercari’s Next-gen Incident Handling Buddy
2024
- LLMs at Work: Outsourcing Vendor Assessment Toil to AI
- Streamlining Security Incident Response with Automation and Large Language Models
- Fine-tuned SigLIP Image Embeddings for Similar Looks Recommendation in a Japanese C2C Marketplace
- Fine-Tuning an LLM to Extract Dynamically Specified Attributes
- LMM based Approach to Large-scale Item Category Classification
2023
- LM-based query categorization for query understanding
- Leveraging LLMs in Production: Looking Back, Going Forward
- The Bitter Lesson about Engineers in a ChatGPT World
- Putting the Voice of Customers into the Software Development Process
- Mercari’s Journey Integrating AI & Search at Berlin Buzzwords 2023
- Improving Item Recommendation Accuracy Using Collaborative Filtering and Vector Search Engine
- Model management for client side ML powered by Firebase
- The Journey to Machine-Learned Re-ranking
- Do We Need Engineers in a ChatGPT World?
Meta
Blog Posts
Meta AI Blog and Research Papers
- Various AI/ML research papers submitted by Meta are available here
- Meta AI blog
2024
- Leveraging AI for efficient incident response
- Maintaining large-scale AI capacity at Meta
- Our next-generation Meta Training and Inference Accelerator
- PVF: A novel metric for understanding AI systems’ vulnerability against SDCs in model parameters
- Optimizing RTC bandwidth estimation with machine learning
- Logarithm: A logging engine for AI training workflows and services
- Building Meta’s GenAI Infrastructure
- Improving machine learning iteration speed with faster application build and packaging
- Lazy is the new fast: How Lazy Imports and Cinder accelerate machine learning at Meta
- How Meta is advancing GenAI
- Serverless Jupyter Notebooks at Meta
- Building custom silicon for the future of AI
2023
- Introducing Code Llama, a state-of-the-art large language model for coding
- Watch: Meta’s engineers on building network infrastructure for AI
- How Meta is creating custom silicon for AI
- AI debugging at Meta with HawkEye
- Arcadia: An end-to-end AI system performance simulator
- MTIA v1: Meta’s first-generation AI inference accelerator
2022
2021
Nubank
Blog Posts
2025
- AskNu: A RAG solution to increase Employees Productivity at Nubank
- Enhancing engineering workflows with AI: a real-world experience
2024
Slalom Build
Blog Posts
2024
- The Secret to Success in Large-Scale Data Engineering Projects
- Generative AI Prompt Engineering: A Balancing Act
- RAG for Quality Engineers
- Feeding AI Models with Slack Data
- Keeping the Human in the Loop
2023
Uber
Blog Posts
2024
- Open Source and In-House: How Uber Optimizes LLM Training
- Genie: Uber’s Gen AI On-Call Copilot
- QueryGPT – Natural Language to SQL Using Generative AI
- Personalized Marketing at Scale: Uber’s Out-of-App Recommendation System
- DataK9: Auto-categorizing an exabyte of data at field level through AI/ML
- From Predictive to Generative – How Michelangelo Accelerates Uber’s AI Journey
- DragonCrawl: Generative AI for High-Quality Mobile Testing
- Scaling AI/ML Infrastructure at Uber
- Stopping Uber Fraudsters Through Risk Challenges
- Model Excellence Scores: A Framework for Enhancing the Quality of Machine Learning Systems at Scale
2023
- The Transformative Power of Generative AI in Software Development: Lessons from Uber’s Tech-Wide Hackathon
- Innovative Recommendation Applications Using Two Tower Embeddings at Uber
- Demand and ETR Forecasting at Airports
- Risk Entity Watch – Using Anomaly Detection to Fight Fraud
- Accelerating Advertising Optimization: Unleashing the Power of Ads Simulation
- uVitals – An Anomaly Detection & Alerting System
2022
- Project RADAR: Intelligent Early Fraud Detection System with Humans in the Loop
- DeepETA: How Uber Predicts Arrival Times Using Deep Learning
- Uber’s Real-Time Document Check
- How Uber Optimizes the Timing of Push Notifications using ML and Linear Programming
- ML Education at Uber: Program Design and Outcomes
- ML Education at Uber: Frameworks Inspired by Engineering Principles
2021
Walmart
Blog Posts
2024
- Managing Secure API Access to LLMs in Distributed Systems with Dataflow
- Creating Web App For File Interactions Using RAG: A Developers Guide
- Exploring the World of Vector Databases: A Comprehensive Guide
- Using Predictive and Gen AI to Improve Product Categorization at Walmart
- AI-Driven Continuous Monitoring: The Future of Third-Party Risk Management
- Build your own GPT (BYO-GPT)
- Textual Titans: A Large Language Model Odyssey
- Evaluation of RAG Metrics using RAGA
- Deploying RAGs in production — Part 2
- Deploying RAGs in production — Part 1
- Extracting Product Attributes from PDFs using PAE Framework
- Augmentation Techniques for Imbalanced text Classification
- Transforming Text Classification with Semantic Search Techniques — Faiss
2023
2022
Wix
Blog Posts
2024
- How Wix’s AI Site-Chat Redefines Chatbots with Adaptive Feedback and Dynamic Knowledge
- AI for Customer Care Routing at Wix - Making The First Step of Reinforcement Learning
- Wix streamlines website content creation with GPT.
- Real-World Forecasting with Deep Learning: How We Do It at Wix
- Customizing LLMs for Enterprise Data Using Domain Adaptation: The Wix Journey
- SageMaker Batch Transform Unleashed: My Journey at Wix to Achieve Scalable ML
- Beyond Content Generation: AI-Based Layout Generation for Graphic Design
Other Resources
Case Studies from vendors and industry
Following resources from AI vendors like OpenAI, Anthropic, and Databricks provide valuable insights into how businesses are leveraging AI to solve complex challenges and drive innovation.
- OpenAI
- Anthropic
- Databricks
- 321 real-world gen AI use cases from the world's leading organizations
- ML and LLM system design: 500 case studies to learn from
Contributors
Contribute
Contributions welcome! Read the contribution guidelines first.
Stargazers Over Time
License
To the extent possible under law, Unmesh Gundecha has waived all copyright and related or neighboring rights to this work.
If you decide to use this anywhere, please credit @upgundecha on X. Also, if you like my work, check out my other projects on GitHub.