The Week-by-Week Syllabus
This syllabus is designed to build your knowledge incrementally to create a solid foundation in AI/LLM development.
Week 1: Data Collection and Preprocessing
What to learn: Techniques for data scraping, cleaning, and preprocessing using pandas and BeautifulSoup.
Why this comes before the next step: Clean data is the cornerstone of any successful AI application, and understanding how to gather and prepare your data effectively is crucial to the development process.
Mini-project/Exercise: Build a small web scraper to collect text data from a website and preprocess it for model training.
Week 2: Understanding Transformer Models
What to learn: Core concepts of transformers, including attention mechanisms and architecture, using Hugging Face Transformers.
Why this comes before the next step: Knowing the intricacies of transformer architecture prepares you to effectively utilize and customize these powerful models for specific tasks.
Mini-project/Exercise: Implement a small transformer model to classify text data from your previous week’s project.
Week 3: Fine-Tuning Pre-trained Models
What to learn: Methods for fine-tuning pre-trained models on specific datasets using PyTorch.
Why this comes before the next step: Mastering fine-tuning techniques will allow you to leverage existing models to enhance performance on niche applications.
Mini-project/Exercise: Fine-tune a pre-trained model on a dataset relevant to your interests and evaluate performance improvements.
Week 4: Deployment of LLM Applications
What to learn: Deploying AI applications with FastAPI and Docker.
Why this comes before the next step: Knowing how to deploy models enables you to take your work from local development to the real world.
Mini-project/Exercise: Create a REST API for your fine-tuned model and deploy it using Docker.
Week 5: Exploring Prompt Engineering
What to learn: Techniques for effective prompt engineering and user interaction with LLMs.
Why this comes before the next step: Optimizing prompts significantly affects the quality of model output, making this an essential skill for any LLM developer.
Mini-project/Exercise: Experiment with different prompts to improve response quality from your deployed LLM API.
Week 6: Evaluating and Optimizing Model Performance
What to learn: Evaluation metrics for AI models and techniques to improve performance.
Why this comes before the next step: Learning to evaluate and iterate on model performance is key to achieving production-level applications.
Mini-project/Exercise: Conduct a performance analysis of your deployed API, document findings, and suggest optimization strategies.