๐Ÿ“ฆ google-gemini / workshops

๐Ÿ“„ README.md ยท 242 lines
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242# Gemini Meetup Workshops

Welcome to the **Gemini Meetup Workshops** repository! This collection serves as a central hub for hands-on, educational, and experimental projects showcasing the power and versatility of Google's Gemini models and related AI technologies (such as CrewAI, PettingZoo, etc.).

Our goal is to provide practical, open-source examples that demonstrate cutting-edge AI applications, from intelligent agents playing games and generating content to embedding AI on edge devices. Whether you're a developer, AI enthusiast, or just curious, these workshops offer a fantastic way to dive deep into real-world Gemini implementations.

---

## ๐ŸŽฌ Interactive Demos

<table>
<tr>
<td width="33%" align="center">

### [๐ŸŽฌ Cameo](./cameo/)
[![Cameo Demo](https://img.youtube.com/vi/yKMFeKnoJuk/maxresdefault.jpg)](https://www.youtube.com/watch?v=yKMFeKnoJuk)
**AI-powered personalized video generator**
[๐Ÿ“– Documentation](./cameo/)

</td>
<td width="33%" align="center">

### [โ™Ÿ๏ธ Chess Companion](./chess/)
[![Chess Demo](https://img.youtube.com/vi/AE4dd4tgNks/maxresdefault.jpg)](https://www.youtube.com/watch?v=AE4dd4tgNks)
**Real-time game analysis with voice interaction**
[๐Ÿ“– Documentation](./chess/)

</td>
<td width="33%" align="center">

### [๐ŸŽฌ TV Companion](./tv/)
[![TV Demo](https://img.youtube.com/vi/IzXzDVdPb98/maxresdefault.jpg)](https://www.youtube.com/watch?v=IzXzDVdPb98)
**AI film critic that controls your TV**
[๐Ÿ“– Documentation](./tv/)

</td>
</tr>
<tr>
<td width="33%" align="center">

### [๐ŸŽฎ Wind Waker Chat](./waker/)
[![Waker Demo](https://img.youtube.com/vi/zvOtG8LkDD8/maxresdefault.jpg)](https://www.youtube.com/watch?v=zvOtG8LkDD8)
**Voice chat with controller support**
[๐Ÿ“– Documentation](./waker/)

</td>
<td width="33%" align="center">

### [๐Ÿงฑ Bricks](./bricks/)
[![Lego Demo](https://img.youtube.com/vi/JBwPYOdZnGs/maxresdefault.jpg)](https://www.youtube.com/watch?v=JBwPYOdZnGs)
**Fine-tuning Gemini for brick-building**
[๐Ÿ“– Documentation](./bricks/)

</td>
<td width="33%" align="center">

### [๐ŸŽจ Artifacts](./artifacts/)
[![Artifacts Demo](https://img.youtube.com/vi/DHIlmG-hP-8/maxresdefault.jpg)](https://www.youtube.com/watch?v=DHIlmG-hP-8)
**AI-powered interactive content platform**
[๐Ÿ“– Documentation](./artifacts/)

</td>
</tr>
<tr>
<td width="33%" align="center">

### [๐ŸŽฏ Multi-Agent Games](./games/)
[![Games Demo](https://img.youtube.com/vi/kUP59cRW9D0/maxresdefault.jpg)](https://www.youtube.com/watch?v=kUP59cRW9D0)
**Gemini plays games with PettingZoo**
[๐Ÿ“– Documentation](./games/)

</td>
<td width="33%" align="center">

### [๐ŸฅŠ Smash Bot](./smash/)
[![Smash Demo](https://img.youtube.com/vi/ajqIn8mWkKY/maxresdefault.jpg)](https://www.youtube.com/watch?v=ajqIn8mWkKY)
**LLM-powered fighting game agent**
[๐Ÿ“– Documentation](./smash/)

</td>
<td width="33%" align="center">

### [๐Ÿ„ Mario Level Generator](./mario/)
[![Mario Demo](https://img.youtube.com/vi/9WwIZadNiWw/maxresdefault.jpg)](https://www.youtube.com/watch?v=9WwIZadNiWw)
**AI creates Super Mario Bros levels**
[๐Ÿ“– Documentation](./mario/) *(Coming Soon)*

</td>
</tr>
<tr>
<td width="33%" align="center">

### [๐Ÿ–ฅ๏ธ Desktop Assistant](./desktop/)
[![Desktop Demo](https://img.youtube.com/vi/wz-s7QiiwWo/maxresdefault.jpg)](https://www.youtube.com/watch?v=wz-s7QiiwWo)
**Gemini controls your desktop**
[๐Ÿ“– Documentation](./desktop/) *(Coming Soon)*

</td>
<td width="33%" align="center">

### [๐Ÿˆ Fantasy Football Draft Companion](./fantasy/)
[![Fantasy Football Draft Companion Demo](https://img.youtube.com/vi/dY1JoEQ9qzU/maxresdefault.jpg)](https://www.youtube.com/watch?v=dY1JoEQ9qzU)
**Gemini guides your fantasy football draft**
[๐Ÿ“– Documentation](./fantasy/)

</td>
<td width="33%" align="center">

</td>
</tr>
</table>

[**View All 18+ Workshops โ†’**](#workshop-directory)

---

## ๐ŸŽค Talks & Presentations

Conference talks, interviews, and educational content exploring AI, LLMs, and the future of human-AI interaction:

### ๐ŸŽ“ Academic & Conference Talks
*   [**Socratic Sparks & Desert Trials: Rethinking Mastery in Age of LLMs**](https://www.youtube.com/watch?v=6uEHXeqi9N8) - IIA @MIT 2025
*   [**Competence in the Age of LLMs**](https://www.youtube.com/watch?v=NaGTw0qg5Vw) - TEDxCSTU ([๐Ÿ“Š slides](https://google-gemini.github.io/workshops/tedx))
*   [**Beyond Automation: Designing AI for Human Flourishing**](https://www.youtube.com/watch?v=5ZhFrzLWkHE) - IIA @MIT 2025
*   [**Thinking Creatively and Critically in the Age of LLMs**](https://www.youtube.com/watch?v=3ZZdohrcGK4) - University of Sussex
*   [**The Future of AI**](https://youtu.be/Ob_HCJ5CaR0) - Magdalen College, Oxford University ([event page](https://www.magd.ox.ac.uk/events/the-future-of-ai-with-googles-peter-danenberg/)) ([๐Ÿ“Š slides](https://google-gemini.github.io/workshops/oxford))

### ๐Ÿ”ฌ Technical Deep Dives
*   [**What it was like to be one of the first engineers on Gemini**](https://www.youtube.com/watch?v=kLidkwA44QA) - IIA @MIT 2025
*   [**Google Engineers On Learnings From Building Gemini**](https://www.youtube.com/watch?v=UUsVTEUfJAY)
*   [**Prompt Engineering with Bard**](https://www.youtube.com/watch?v=i01cizb6Txg) - Google Engineer Talk

### ๐ŸŽฎ AI in Action
*   [**Playing Super Smash Bros with agentic Gemini**](https://www.youtube.com/watch?v=AR0o9DLF0H0)

### ๐Ÿ“บ Interviews & Media
*   [**AI RISKS with GOOGLE's Peter Dananberg**](https://www.youtube.com/watch?v=S32rx_u3jJ8)
*   [**Google's Gemini AI: EXCLUSIVE Interview**](https://www.youtube.com/watch?v=By7XnT4MtJI)
*   [**Google GenAI Virtual Seminar (Ep. 3)**](https://www.youtube.com/watch?v=dT1ZxW7xJBs)
*   [**LLMs the Language of Creativity, Power and Intelligence**](https://www.youtube.com/watch?v=1i_8824zBO4) - SZA Talk

**Related Coverage:** [CFA Society Boston event](https://www.cfaboston.org/iCore/Events/Event_display.aspx?EventKey=060325&WebsiteKey=6e0836e4-c0e5-45a8-9d93-beb2795473a4), [Forbes article on AI sentience](https://www.forbes.com/sites/johnwerner/2025/07/01/are-we-speaking-to-sentient-ai-and-is-that-good/), [Mint article on Gemini](https://www.pressreader.com/india/mint-chennai/20250630/281943138875699?srsltid=AfmBOoo5LFLylqjvY_AXicfVoz-wNQqW9mxq-nkMMKRRbWvbPYQUF4q9), [Margaret Boden Lecture at Sussex](https://digit-research.org/news/digit-co-hosts-first-margaret-boden-annual-lecture-on-interdisciplinary-ai/)

---

## ๐Ÿ“š Workshop Directory

Explore the diverse range of projects available in this repository. Each link takes you to a dedicated subdirectory with its own `README.md` containing detailed instructions and code.

*   [`adventure/`](./adventure/) - **Dungeon Adventure using CrewAI and Gemini:** A text-based adventure game leveraging CrewAI and Gemini to create dynamic, agent-driven storytelling with a persistent Dungeon Master and multiple player agents.
*   [`artifacts/`](./artifacts/) - **Unveiling Artifacts: AI-Powered Interactive Content Platform:** Explores AI-driven content creation, demonstrating how LLMs can dynamically generate and refine interactive web experiences (HTML, CSS, JS) through an iterative, conversational workflow.
*   [`bricks/`](./bricks/) - **Bricks: Fine-tuning Gemini for Brick-Building:** An end-to-end pipeline for fine-tuning Gemini 1.5 Pro to understand natural language descriptions and generate precise LDraw (MPD) brick-building instructions. ([๐Ÿ“Š slides](https://google-gemini.github.io/workshops/bricks))
*   [`cameo/`](./cameo/) - **Cameo: AI-Powered Personalized Video Generator:** Creates personalized videos using your face and voice, combining MediaPipe face detection, Google Veo 3 video generation, and ElevenLabs voice cloning for fully customized AI-generated video content.
*   [`chess/`](./chess/) - **Chess Companion:** An AI-powered assistant providing real-time analysis, voice interaction, and TV control for live chess games, combining expert insights with historical context and engine analysis.
*   [`cost/`](./cost/) - **LLM Cost Comparison:** A hypothetical cost-comparison analysis of Gemini, GPT, and Llama 3 for a high-traffic app scenario, presented in a workshop slide format. ([๐Ÿ“Š slides](https://google-gemini.github.io/workshops/cost))
*   [`doodles/`](./doodles/) - **Dynamic SVG Storytelling & Visualization:** A powerful pipeline for converting static bitmap images into dynamic, interactive, and animatable SVGs for engaging visual storytelling and presentations.
*   [`factuality/`](./factuality/) - **Factuality: News Summarization & Redaction:** A Perplexity-style clone built with CrewAI that summarizes, cites, and redacts news articles, focusing on factual accuracy and source verification. ([๐Ÿ“Š slides](https://google-gemini.github.io/workshops/factuality))
*   [`fantasy/`](./fantasy/) - **Fantasy Football Draft Companion** A Chrome extension and server combination that streams your screen and microphone input to Gemini for commentary over a Sleeper draft, as well as assistance with any user queries, player research, or auto-recommendations.
*   [`games/`](./games/) - **Gemini Plays Games: Multi-Agent Game Simulation:** Demonstrates how to play multi-agent games with Gemini using PettingZoo and CrewAI, exploring AI interaction and strategy in simulated game environments. ([๐Ÿ“Š slides](https://google-gemini.github.io/workshops/games))
*   [`history/`](./history/) - **Interactive LLM History & Concepts Visualizer:** Provides dynamic, animated visualizations to explain the historical development, growth patterns, and conceptual challenges (like "model collapse") of Large Language Models. ([๐Ÿ“Š slides](https://google-gemini.github.io/workshops/history))
*   [`kundali/`](./kundali/) - **Kundali Milan: AI Jyotish Guru:** Implements a Jyotish guru as a CrewAI agent to perform Kundali Milan (Vedic astrology compatibility analysis) based on provided birth details.
*   [`podcast/`](./podcast/) - **AI Podcast Generator:** An innovative, AI-driven pipeline for automatically generating engaging podcast-style video content, from dynamic scripting and voice-overs to music integration and final video assembly.
*   [`prisoner/`](./prisoner/) - **LLM Agents in Game Theory: Prisoner's Dilemma Simulator:** A fascinating exploration into AI strategy, simulating the classic Prisoner's Dilemma with two distinct LLM agents strategizing and interacting in an evolving, turn-based game.
*   [`recap/`](./recap/) - **Workshop Recap:** A summary of key learnings and advancements made since the inception of the Gemini workshops, presented as a slide deck. ([๐Ÿ“Š slides](https://google-gemini.github.io/workshops/recap))
*   [`roast/`](./roast/) - **AI Roast Battle Arena: Uncensored Wit with LLMs:** Showcases a dynamic, multi-agent AI system designed to host uncensored roast battles between different LLMs, with an independent third LLM acting as an impartial judge.
*   [`scripts/`](./scripts/) - **LLM-Powered Content Generation & Developer Experience:** A robust framework for generating various content types, including presentation slides and factual summaries, powered by LLMs, emphasizing efficient developer workflows.
*   [`startupcon/`](./startupcon/) - **StartupCon Talk:** Workshop presentation on building with Gemini and AI agents. ([๐Ÿ“Š slides](https://google-gemini.github.io/workshops/startupcon))
*   [`smash/`](./smash/) - **Smash Bot: An LLM-Powered Fighting Game Agent:** A cutting-edge AI agent capable of playing a fighting game (like Super Smash Bros.) by observing the game state and controlling a virtual gamepad, powered by a multimodal LLM. ([๐Ÿ“Š slides](https://google-gemini.github.io/workshops/smash))
*   [`tv/`](./tv/) - **TV Companion: Your AI Film Critic and Controller:** An advanced LLM-powered assistant that "watches" movies and shows in real-time, offering insightful commentary, deep-diving into film trivia, and controlling your TV through natural voice commands.
*   [`utils/`](./utils/) - **LLM Demonstration Workshop Project:** A practical demonstration of integrating LLMs into applications, showcasing advanced capabilities through automated content generation, such as real-time news summaries and LLM cost comparisons.
*   [`waker/`](./waker/) - **Wind Waker Voice Chat with Controller Support:** A real-time AI gaming companion for The Legend of Zelda: Wind Waker, combining native audio voice chat, visual game understanding, direct controller actuation, and episodic memory. ([๐Ÿ“Š slides](https://google-gemini.github.io/workshops/waker))
*   [`wearable/`](./wearable/) - **Wearable Gemini:** Demonstrates how to embed Gemini on a Raspberry Pi Zero W with an Adafruit Voice Bonnet, utilizing Google's speech-to-text and text-to-speech for an on-device AI assistant.

---

## ๐Ÿš€ Getting Started

To explore and run these workshops:

1.  **Clone the Repository:**
    ```bash
    git clone https://github.com/google-gemini/workshops.git
    cd workshops
    ```
2.  **Navigate to a Project:** Choose a workshop from the directory above and navigate into its specific folder (e.g., `cd adventure/`).
3.  **Read the Project's README:** Each subproject has its own `README.md` with detailed instructions on prerequisites, installation, API key setup, and how to run the specific demonstration.
4.  **Install Dependencies:** Most projects will require Python and dependency management tools like `pip` or `poetry`. Follow the `README.md` for the exact steps.
5.  **Set Up API Keys:** Many workshops utilize the Gemini API and potentially other Google Cloud services (Speech-to-Text, Text-to-Speech). Ensure you have the necessary API keys and credentials configured as described in the individual project's `README.md`.

---

## ๐ŸŽฏ Categories

To help you find projects of interest, we've grouped them into thematic categories. Note that some projects may naturally fit into multiple categories due to their multifaceted nature.

### AI/ML Applications & Agents
Projects focusing on practical applications of LLMs, multi-agent systems, and real-world problem-solving.
*   [`adventure/`](./adventure/) - Dungeon Adventure using CrewAI and Gemini
*   [`artifacts/`](./artifacts/) - Unveiling Artifacts: AI-Powered Interactive Content Platform
*   [`bricks/`](./bricks/) - Bricks: Fine-tuning Gemini for Brick-Building
*   [`factuality/`](./factuality/) - Factuality: News Summarization & Redaction
*   [`kundali/`](./kundali/) - Kundali Milan: AI Jyotish Guru
*   [`roast/`](./roast/) - AI Roast Battle Arena: Uncensored Wit with LLMs
*   [`tv/`](./tv/) - TV Companion: Your AI Film Critic and Controller
*   [`utils/`](./utils/) - LLM Demonstration Workshop Project

### Games & Interactive Entertainment
Projects where AI interacts within game environments, provides interactive entertainment, or simulates game theory scenarios.
*   [`chess/`](./chess/) - Chess Companion
*   [`fantasy/`](./fantasy/) - Fantasy Football Draft Companion
*   [`games/`](./games/) - Gemini Plays Games: Multi-Agent Game Simulation
*   [`prisoner/`](./prisoner/) - LLM Agents in Game Theory: Prisoner's Dilemma Simulator
*   [`smash/`](./smash/) - Smash Bot: An LLM-Powered Fighting Game Agent
*   [`waker/`](./waker/) - Wind Waker Voice Chat with Controller Support

### Content & Media Generation
Projects focused on generating various forms of content, including text, visual, audio, and video, using AI.
*   [`artifacts/`](./artifacts/) - Unveiling Artifacts: AI-Powered Interactive Content Platform
*   [`bricks/`](./bricks/) - Bricks: Fine-tuning Gemini for Brick-Building
*   [`cameo/`](./cameo/) - Cameo: AI-Powered Personalized Video Generator
*   [`doodles/`](./doodles/) - Dynamic SVG Storytelling & Visualization
*   [`factuality/`](./factuality/) - Factuality: News Summarization & Redaction
*   [`podcast/`](./podcast/) - AI Podcast Generator
*   [`scripts/`](./scripts/) - LLM-Powered Content Generation & Developer Experience
*   [`utils/`](./utils/) - LLM Demonstration Workshop Project

### Tools, Utilities & Core Concepts
Projects that build development tools, illustrate fundamental AI concepts, provide analytical insights, or enhance developer experience.
*   [`cost/`](./cost/) - LLM Cost Comparison
*   [`history/`](./history/) - Interactive LLM History & Concepts Visualizer
*   [`recap/`](./recap/) - Workshop Recap
*   [`scripts/`](./scripts/) - LLM-Powered Content Generation & Developer Experience
*   [`utils/`](./utils/) - LLM Demonstration Workshop Project

### Hardware & Edge AI
Projects demonstrating the integration of AI capabilities on physical hardware or embedded systems.
*   [`wearable/`](./wearable/) - Wearable Gemini

---

We encourage you to explore, experiment, and contribute! Happy learning!