Helping pet parents appreciate the real cost of vet care through an personalized, LLM-powered report, driving a 125% lift in visit-to-quote conversion.
Timeline
Jul 2025 - Oct 2025
Product
Pumpkin Wellness Club
My Role
Originitated & led, end-to-end
Tech Stack
Typescript (Next.js), Zod, Drizzle ORM, Gemini API (via Vercel AI SDK), Shadcn, Motion
I was investigating CVR gaps across the funnel when a pattern emerged from user interviews: prospective customers — especially first-time pet owners — couldn't evaluate whether a wellness plan was worth it because they had no baseline for what vet care actually costs.
“I had no idea how much certain things would cost because this is my first dog.” - Ellie R., DTC lead
“I wasn’t sure I’d use all the items enough to make it worth it.” - Stephanie S., DTC lead

This wasn't a roadmap item. I'd been digging into our CVR gaps, kept hearing the same cost-uncertainty in interviews, and built a prototype to test the bet before anyone asked for one.

The goal was to serve as a gut-check for pet parents considering Pumpkin Wellness Club: here's roughly what you might spend, here's what a wellness plan covers, and now you can make an informed decision.
Building it myself meant owning calls a designer usually doesn't touch: which model to trust, and how responsibly to use it. Initially, I created this as a proof of concept, but after reviewing it with my engineering partners, my final product ended up being robust enough to launch into prod.
For this project, I built most of the frontend components myself, using Claude Code to speed up some of the backend work.
I tested several models, and Gemini 3.0 Flash won on for best balancing what mattered here: clean structured output, the most accurate price ranges, and cheap to generate.


While it's gotten less likely, LLMs are prone to hallucinating, and to release an MVP so we could learn quickly, we deliberately didn't use anything like RAG to root the estimates in actual cost of care data.
When designing the prompts used to generate the report, I instructed the model to provide an estimated range, and tested reliability against actually industry cost data. (Ultimately, with some refinements to the prompt, I was able to achieve approximately ~90% accuracy in the outputted cost ranges.)
This was the most unique part of this project, and somewhere that straddling both design and code allowed me to solve very effectively.
When testing, I noticed that sending Gemini a giant prompt produced vague, inconsistent reports. So I broke generation into 5 individual, focused prompts and designed how they'd run: health risks and the wellness plan generate concurrently to keep the report fast, while the treatment plan (which relies on knowing each of the top 3 health risks) waits for the health risks object to be generated before finishing.
Shipped to real users. The Pet Health Predictor generated a 125% higher visit-to-quote rate than typical entry points (including search, ads, and organic traffic).
