Generative AI
In Production.
Building reliable LLM applications is hard. Build applications that scale naturally from your laptop to production with the right tools and libraries.
- 50% discount off the final price
- Early access to the course
- Access to new modules as published
Preorders will be open for a limited time
Enroll now
What We'll Cover
We'll teach you everything you need to know to get you from beginner to expert when working with LLMs. You'll learn how to evaluate your llm applications, stich together complex workflows and ultimately deploy them to real users
Module 1
Introduction
We'll set up your development environment and get you started with the basics of building LLM applications.
Lesson 1
Unreleased
Python Setup
Get your development environment ready with just the essential tools and libraries you need.
Lesson 2
Unreleased
Core AI Concepts
Learn the fundamental AI and LLM concepts required for building production applications.
Lesson 3
Unreleased
Problem Introduction
Understand the problem space and requirements for building reliable LLM applications.
Lesson 4
Unreleased
Requirements
Define clear requirements and constraints for the system.
Lesson 5
Unreleased
System Design
What are the main components of a LLM application?
Module 2
RAG
We'll build a simple RAG application and get exposed to concepts like BM25, Re-Rankers, Embeddings and more
Lesson 1
Unreleased
RAG
Master Retrieval-Augmented Generation (RAG), the game-changing technique that enhances LLM responses with real-world knowledge
Lesson 2
Unreleased
Evaluating Retrieval
Deep dive into BM25 and key metrics to measure and optimize your retrieval system's effectiveness
Lesson 3
Unreleased
Re-Rankers
Leverage advanced re-ranking techniques to dramatically improve search relevance and accuracy
Lesson 4
Unreleased
Embedding
Harness the power of vector embeddings to capture semantic meaning and revolutionize your search capabilities
Lesson 5
Unreleased
Hybrid Search
Create a best-of-both-worlds search system by masterfully combining RAG with re-ranking strategies
Lesson 6
Unreleased
Building a RAG Application
Put it all together to build a production-ready RAG system that delivers accurate, context-aware responses
Module 3
Multimodal QnA
We'll look at how we can utilise image embeddings and vision models to build more complex applications that go beyond just simple OCR
Lesson 1
Unreleased
Setting Up Python
Set up a complete Python development environment with VS Code, uv, OpenAI API access, FastAPI, and essential libraries for multimodal AI development
Lesson 2
Unreleased
Core AI Concepts
Master fundamental concepts like image embeddings, vision-language models, prompt engineering, and vector databases essential for multimodal AI
Lesson 3
Unreleased
Problem Introduction
Learn to build an AI shopping assistant that helps users find products, answer questions, and compare items with high accuracy and speed
Lesson 4
Unreleased
Image Understanding
Implement advanced image processing and understanding capabilities using Vision-Language Models and embeddings
Lesson 5
Unreleased
Query Processing
Build robust query understanding systems that can interpret and process user questions about visual products
Lesson 6
Unreleased
Performance Optimization
Optimize your multimodal QA system to achieve sub-2-second response times while maintaining 95%+ accuracy
Module 4
Information Extraction using VLMs
We'll then look at how we can use VLMs to extract information from images and documents, taking advantage of their vision capabilities to go beyond simple QnA
Lesson 1
Unreleased
Introduction to VLMs for Information Extraction
Learn the fundamentals of Vision Language Models and how they can be used to extract structured information from images and documents
Lesson 2
Unreleased
Working with GPT-4V
Implement information extraction pipelines using GPT-4V, focusing on prompt engineering and output formatting for reliable data extraction
Lesson 3
Unreleased
Open Source Alternatives
Explore open source VLMs like LLaVA and CogVLM for information extraction tasks, with practical deployment considerations
Lesson 4
Unreleased
Document Processing Pipeline
Build an end-to-end document processing pipeline combining OCR, VLMs, and post-processing for robust information extraction
Lesson 5
Unreleased
Handling Complex Documents
Advanced techniques for extracting information from complex documents like invoices, receipts, and forms using VLMs
Lesson 6
Unreleased
Validation and Error Handling
Implement validation rules and error handling mechanisms to ensure accurate and reliable information extraction
Module 5
Real-Time Voice Applications
Learn to build real-time voice applications using modern AI models, focusing on speech recognition, text-to-speech, and interactive voice experiences
Lesson 1
Unreleased
Voice AI Fundamentals
Understand the core components of voice AI systems including speech recognition (ASR), text-to-speech (TTS), and voice activity detection
Lesson 2
Unreleased
Building with OpenAI Whisper
Implement real-time speech recognition using OpenAI's Whisper model, handling streaming audio and processing continuous speech
Lesson 3
Unreleased
Google Gemini Voice Integration
Integrate Google's Gemini API for advanced voice capabilities, including multilingual support and speaker diarization
Lesson 4
Unreleased
Real-Time Voice Processing
Design and implement real-time voice processing pipelines with WebRTC and WebAudio APIs for low-latency applications
Lesson 5
Unreleased
Voice UI/UX Design
Learn best practices for voice user interface design, including conversation design, error handling, and user feedback
Lesson 6
Unreleased
Deployment and Scaling
Deploy voice applications at scale, handling concurrent users, optimizing latency, and managing cloud resources efficiently
Get Started Today
Learn how to build, prototype and deploy LLM applications using some of the latest technology
Code + eBook
$70$140
Get comprehensive learning materials along with the code.
- Detailed eBook guide with code examples
- In-depth explanations
- Deployment guides
Complete Package
$140$280
Get the full learning experience with video walkthroughs.
- Everything in Code + eBook
- Video tutorials
- Step-by-step walkthroughs
- Implementation tips
Frequently Asked Questions
Do you offer refunds?
Yes! If you're not happy with the course, just drop us an email at dev@genaiinprod.com and we'll refund you in full.
Can I get an invoice for this course?
Yes! If you're a business and need an invoice, just drop us an email at dev@genaiinprod.com and we'll get you sorted. We also provide a bulk ticket discount for companies with a team of 3 or more.
What can I do after this course?
By the end of this course, you'll have deployed a total of 4 different LLM models and walked away with a solid understanding of how to deploy them in production. There's a lot more to learn and this course is just the beginning.
What if I have a question after the course?
We'll be around to help you out. You can join our Discord community and ask questions there. We'll also be around to help you out.
Stay Updated
Get early access to our source code and stay informed about our launch. We're fully open source and subscribers get exclusive repository access from day one.