
Share
This article demonstrates how Google Gemini can efficiently scrape video content, converting scattered numerical data into JSON format at an incredibly low cost-less than a tenth of a cent per operation.
The other day, I faced a common but tedious task: adding up numeric values scattered across twelve different emails. Instead of manually copying and pasting each number, I decided to try something innovative: using screen capture and Google Gemini to extract the data.
I started by recording a 35-second screen capture using QuickTime Player on my Mac. The process was straightforward:
File -> New Screen Recording.Next, I uploaded the video to Google’s AI Studio, an AI-powered tool that can process and analyze various types of data. I prompted Gemini with the following request:
Turn this into a JSON array where each item has a yyyy-mm-dd date and a floating point dollar amount for that date
To my surprise, it worked flawlessly. The output was a well-structured JSON array:
[
{
"date": "2023-01-01",
"amount": 2...
},
...
]
Since I needed the data in a format that could be easily pasted into Numbers, I followed up with another prompt:
turn that into copy-pastable csv

Gemini delivered the same data formatted as a CSV:
date,amount
2023-01-01,2...
...
To ensure accuracy, I re-watched the 35-second video and manually verified the numbers. Gemini got everything right.
I intended to use Gemini 1.5 Pro, Google’s most advanced model, but accidentally used Gemini 1.5 Flash 002, a much less expensive option. According to AI Studio, I used 11,018 tokens, with 10,326 tokens dedicated to the video.
The cost breakdown:
This means the entire process should have cost just under 1/10th of a cent.
However, it turned out to be free. Google AI Studio currently remains free of charge regardless of whether you set up billing across all supported regions. This is part of their effort to encourage users to try their tools, though it also means they can train on your data, which is not the case for their paid APIs.
This experiment demonstrates the power and efficiency of modern AI tools like Google Gemini in handling complex tasks with minimal effort and cost. Whether you're dealing with emails, documents, or other data sources, these tools can significantly streamline your workflow.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
25 October 2024
88 articles
Related Articles
Related Articles
More Stories