GENERATIVE AI

25+ Generative AI Projects for Students in 2026 (With GitHub Links)

By BuildIdeas Team·May 26, 2026·7 min read
Updated: May 2026

Beginner Projects

Conversational RAG Document Q&A Chatbot (PDF / DOCX)

A production-ready RAG chatbot that lets users upload PDF, DOCX, and HTML files, then ask follow-up questions with full conversational memory. Backed by ChromaDB for vector storage, FastAPI for the backend, and a Streamlit frontend with LangSmith tracing — a complete, deployable GenAI starter project.

PythonLangChainOpenAI GPT-4o-miniChromaDBFastAPIStreamlitLangSmithSQLite
View on GitHub

Multi-LLM RAG Chatbot (OpenAI, Gemini, HuggingFace)

Upload documents (PDF, CSV, DOCX) and chat with them using your choice of LLM — OpenAI GPT-4, Google Gemini Pro, or HuggingFace Mistral. Backed by ChromaDB for vector storage, built with LangChain and deployed as a Streamlit app — ideal for understanding multi-provider LLM integration.

PythonLangChainOpenAI APIGoogle GeminiHuggingFace TransformersChromaDBStreamlit
View on GitHub

AI YouTube Video Summarizer & Q&A App

Paste any YouTube URL and get a concise AI-generated summary of the video's transcript using OpenAI Whisper for transcription and GPT/LangChain for summarization. Also supports Q&A mode — ask any question about the video's content and get context-aware answers with source timestamps.

PythonOpenAI WhisperLangChainGPT-3.5/GPT-4StreamlitYouTube Transcript API
View on GitHub

AI Meeting Notes Summarizer with Action Item Extraction

Upload audio or video recordings of meetings and get back structured summaries, key decisions, action items (with assignees and deadlines), and sentiment analysis — powered by Whisper for transcription and GPT-4o-mini for structured extraction, deployed via Streamlit with Docker support.

PythonOpenAI WhisperGPT-4o-miniStreamlitDockerpython-docxLangchain
View on GitHub

Natural Language to SQL Query Generator

A Streamlit/Gradio app where users type plain English questions ("Show me the top 5 customers by revenue last month") and get auto-generated SQL queries executed against their database in real time — powered by LangChain and OpenAI, supporting SQLite, PostgreSQL, and MySQL.

PythonLangChainOpenAI GPTStreamlitSQLitePostgreSQLMySQLSQLAlchemy
View on GitHub

AI Voice Chat Assistant (Whisper + GPT-4 + TTS)

A hands-free AI voice assistant that transcribes your speech with OpenAI Whisper, processes it through GPT-4, and responds in natural-sounding speech via multiple TTS options (OpenAI TTS, ElevenLabs, Google TTS). Supports wake-word activation, character switching, and screen vision mode.

PythonOpenAI WhisperGPT-4oElevenLabs TTSFastAPIWebSocketsOllama (local LLM option)
View on GitHub

AI-Powered Cold Email Generator (LLaMA + LangChain + ChromaDB)

Paste a job posting URL and get an AI-crafted, personalized cold email tailored to the company's needs — using LLaMA 3.1 to parse job descriptions and ChromaDB to match relevant portfolio links from a vector-indexed skill database. Deployable via Streamlit with Docker and GitHub Actions CI.

PythonGroq LLaMA 3.1LangChainChromaDBStreamlitDockerGitHub Actions
View on GitHub

Multimodal AI Image Chat with Gemini Pro Vision

Upload any image and chat with it in natural language using Google's Gemini Pro Vision model — ask about object identification, chart data, document OCR, or scene descriptions. Deployed as a Streamlit web app with full conversation history and multi-turn visual Q&A.

PythonGoogle Gemini Pro VisionStreamlitPIL (Pillow)python-dotenv
View on GitHub

AI Personal Finance Coach Agent (Multi-LLM + RAG)

A conversational AI financial advisor that analyzes spending patterns, provides personalized budgeting recommendations, and answers portfolio questions — supporting Ollama (offline), Gemini, and OpenAI with 10-year historical data and Dockerized deployment via Streamlit.

PythonOllamaOpenAI APIGoogle GeminiStreamlitDockerRAG (vector search)yFinance
View on GitHub

Intermediate Projects

LLM Fine-Tuning with PEFT, LoRA & RLHF (FLAN-T5)

An end-to-end pipeline for fine-tuning FLAN-T5 on dialogue summarization — comparing full fine-tuning, PEFT/LoRA, and RLHF with PPO for toxicity reduction. Evaluates results with ROUGE metrics, includes a Streamlit demo, and demonstrates modern LLM adaptation at every level of compute budget.

PythonHuggingFace TransformersPEFTLoRATRL (PPO/RLHF)FLAN-T5ROUGEStreamlitPyTorch
View on GitHub

Autonomous Deep Research Agent (LangGraph + Multi-Agent)

A multi-agent research system where four specialized LangGraph agents collaboratively search the web, analyze sources, score credibility, and generate citation-backed research reports in PDF/Markdown. Supports local Ollama models and cloud APIs (Gemini, OpenAI) with a 7-day result caching layer.

PythonLangGraphLangChainTavily SearchOllamaGoogle GeminiOpenAISQLite (checkpointing)Docker
View on GitHub

AI Code Review Tool (Multi-Provider: Claude, Gemini, OpenAI)

A CLI-based AI code reviewer supporting GitHub, GitLab, Bitbucket, Azure DevOps, and Gitea — running 15+ review types (security, performance, unused code) using Claude, Gemini, or OpenAI. Features 95%+ token reduction via semantic chunking, ReAct agent mode for deep codebase exploration, and OWASP-aligned security analysis.

TypeScriptLangChainOpenAI APIAnthropic Claude APIGoogle GeminiGitHub Actions
View on GitHub

GPT Research Agent with Autonomous Web Search (FastAPI)

A FastAPI-powered autonomous research agent that takes any topic, plans a research strategy, searches the web in parallel, synthesizes findings, and generates a structured PDF/Markdown report — inspired by the STORM paper. Supports custom LLM backends via LangChain adapter.

PythonFastAPILangChainOpenAI GPT-4Tavily Search APIWeasyPrint (PDF generation)Docker
View on GitHub

Knowledge Graph Builder from Unstructured Text (LLM + Neo4j)

Build structured knowledge graphs automatically from PDFs, Wikipedia articles, or plain text — using GPT-4o to extract entities and relationships, store them in a Neo4j graph database, and then query the graph with natural language questions via Cypher query generation.

PythonOpenAI GPT-4oLangChainNeo4jWikipedia LoaderJupyter Notebook
View on GitHub

AI Meeting Intelligence Agent (Whisper + GPT-4o + Email)

A full pipeline that transcribes audio/video meetings using Whisper, generates structured Minutes of Meeting (summary, decisions, action items with assignees and deadlines) using GPT-4o-mini, and emails the report automatically — packaged as a Streamlit app with Docker support.

PythonOpenAI WhisperGPT-4o-miniStreamlitDockerSMTP (email automation)LangChain
View on GitHub

Multi-Agent Project Planner with CrewAI & Chainlit

An intelligent software project planning system using CrewAI's multi-agent framework where specialized agents autonomously decompose requirements, estimate timelines, allocate resources, and create a comprehensive project plan — served through a conversational Chainlit interface.

PythonCrewAIChainlitOpenAI GPT-4LangChainPydantic
View on GitHub

LangGraph Stateful Agent Orchestrator with Semantic Routing

A production-ready stateful agent using LangGraph's conditional state graph with semantic routing that triages user queries to specialized workflows — code assistance, finance queries, general knowledge — while maintaining full conversational memory across sessions via InMemorySaver.

PythonLangGraphLangChainOpenAI GPT-4Semantic RoutingInMemorySaver
View on GitHub

AI-Powered Smart Email Assistant (LangChain + Gmail API)

A Streamlit-based AI email assistant that connects to Gmail via OAuth2 and uses LangChain agents with Google Gemini/OpenAI to read, categorize, draft replies, and compose new emails in natural language — demonstrating real agentic tool-use with a real productivity API.

PythonLangChainGoogle Gemini / OpenAI GPT-4Gmail APIOAuth2Streamlit
View on GitHub

Advanced Projects

Production-Grade Agentic RAG Platform (GraphRAG + FAISS + Kubernetes)

A production-grade agentic RAG platform combining ChromaDB, BM25, and FAISS retrieval strategies with LangChain agent orchestration, real-time Socket.IO streaming, MongoDB persistence, and Redis caching — deployable via Docker Compose, Kubernetes (with Argo Rollouts), and Terraform on AWS/OCI.

PythonLangChainFlaskOllamaChromaDBFAISSBM25MongoDBRedisSocket.IODockerKubernetesTerraformReact (frontend)
View on GitHub

Open Deep Research Agent (LangGraph + MCP + Tavily)

LangChain's official open-source deep research system built on LangGraph — autonomously plans multi-step research, searches the web via Tavily, synthesizes reports using any LLM (Ollama/Gemini/OpenAI), and exports to Markdown with full MCP compatibility for AI agent integration. Benchmarked on PhD-level research tasks.

PythonLangGraphLangChainTavily Search APIOllamaGoogle GeminiOpenAIDockerLangSmith Studio
View on GitHub

AI Financial Advisory Agent (RAG + Function Calling + LangGraph)

An intelligent financial advisory system showcasing 5 core GenAI capabilities: Structured Output (JSON mode), RAG, embeddings, function calling, and LangGraph agents. Analyzes spending patterns, recommends portfolio adjustments, and answers finance questions with evidence-backed advice — powered by Google Gemini.

PythonGoogle Gemini APILangChainLangGraphFAISSStreamlitChromaDBJupyter Notebook
View on GitHub

Fully Local Web Research & Report Writing Agent (Ollama + LangGraph)

A 100% local deep research agent that uses Ollama LLMs and open web search APIs to autonomously research any topic and produce comprehensive, citation-rich reports — all without sending data to any external cloud API. Runs in Docker with LangGraph Studio UI for workflow visualization.

PythonLangGraphOllama (Llama 3.2)Tavily / DuckDuckGo SearchDockerLangSmith Studio
View on GitHub

Multi-Agent AI Research Assistant (CrewAI + Exa Search)

A CrewAI-powered multi-agent research crew where specialized agents (Researcher, Summarizer, Fact-Checker) collaborate sequentially to explore any topic, validate findings against live sources using Exa search, and produce polished research reports — with AgentOps monitoring for production observability.

PythonCrewAIExa Search APILLaMA 3.1 (via AIML API)StreamlitAgentOpsFirecrawl
View on GitHub

GraphRAG Knowledge Base with Neo4j + LangChain (Multi-Source)

Build a full GraphRAG system that ingests PDFs, web pages, YouTube videos, and Office documents into a Neo4j knowledge graph using LLM-powered entity extraction (GPT-4/Gemini/Llama3) and supports three retrieval strategies — GraphRAG, vector search, and Text2Cypher — via a FastAPI + React interface.

PythonLangChainOpenAI / Gemini / Llama3Neo4jFastAPIReactDockerLLMGraphTransformer
View on GitHub

AI Coding Assistant with Multi-Provider Review, Agent Mode & CI/CD

A full-featured AI code review system for GitHub, GitLab, Bitbucket, Azure DevOps, and Gitea that supports inline and summary reviews, thread replies, and a ReAct-style agent mode where the LLM explores the repository with shell commands before reviewing — configurable via YAML/JSON with seamless CI/CD pipeline integration.

PythonOpenAI / Anthropic Claude / Google Gemini / Ollama / Bedrock / Azure OpenAIDockerGitHub ActionsCI/CD
View on GitHub

Tips for Building Projects That Get You Hired

  1. 1

    Solve a real problem — "PDF Q&A for legal documents" is stronger than "a chatbot."

  2. 2

    Show RAG knowledge — most production LLM apps use RAG; build at least one RAG project.

  3. 3

    Use open-source models like Mistral or LLaMA alongside OpenAI — shows technical depth.

  4. 4

    Explain your chunking and embedding strategy — interviewers drill into this.

  5. 5

    Deploy on Hugging Face Spaces or Streamlit Cloud with a working demo link.

Not sure which Gen AI project to build?

Tell BuildIdeas your stack and experience level and get 3 tailored Gen AI project ideas with full build roadmaps in under 60 seconds.

Generate My Gen AI Project