We are:
Goodie AI is the pioneering LLM visibility and AI search optimization platform enabling the world’s top brands to own their AI narrative across leading LLMs like ChatGPT, Gemini and Perplexity. Backed by strong funding and validated by active paying customers, we are scaling fast and tackling some of the hardest AI search challenges.
After you apply, check out Goodie AI to learn even more!
We are looking for:
Goodie AI is searching for a talented and ambitious Data Scientist to join our growing team! Goodie helps brands win visibility and revenue across AI search, LLMs, and agentic commerce. You will be the point person turning messy multi-model signals into measurement, forecasts, and optimizations that our product can act on. If you enjoy building models that ship and change customer behavior, you will like this seat.
You’ll do:
- Work with large datasets. Own efficient querying, cleaning, labeling, and taxonomy alignment for brands, SKUs, and categories.
- Design sampling and classification strategies that turn noisy LLM outputs and crawler logs into reliable brand and product insights.
- Use LLMs and NLP to extract structure from unstructured text at scale. Topics include query fan-out, sentiment, citation extraction, and entity linking for brands, products, and creators.
- Define product-grade metrics. Create durable definitions for visibility score, answer coverage, product presence, and agentic checkout readiness.
- Build and run experimentation frameworks. A/B tests, holdouts, counterfactuals, and uplift modeling to quantify impact on citations, share of voice, and conversions.
- Develop and refine predictive models that analyze and forecast AI search behavior across models and surfaces.
- Translate complex findings into clear decisions. Partner with the founding team to inform roadmap, pricing, and customer playbooks.
- Create evaluation harnesses. Establish automatic evals and human-in-the-loop labeling for model quality, bias, and drift across LLM providers.
- Detect anomalies. Build monitors for crawler behavior, rankings, and feed health to catch regressions before customers do.