AI Mode, Generative AI, SEO

Your Inbox Might Be the Next AI Search Signal

by Garrett Sussman

05.21.2026

//.study_co_authors

Garrett Sussman

Marketing Director

Michael Tandoh

SEO ENGINEER

Cate Dombrowski

ASSOCIATE BI ANALYST

We Tested Google Personal Intelligence. Gmail Signals Changed AI Mode Brand Recommendations.

TL;DR

Brand signals added to personal context in a Google Personal Intelligence connected account changed AI Mode recommendations: those brands were +46 percentage points more likely to appear versus control.
The lift was measurable: brand appearance rose from 23.9% to 66.8% in the Personal Intelligence account.
Brands moved into better positions: they gained +23.1 points in top-3 placement and +42.8 points in top-10 placement.
Gmail was the clearest tested signal: brands introduced through email appeared in 53.6% of responses, making inbox content a major AI Search variable.

Your inbox may be turning into an AI search signal.

That sounds dramatic until you read Google’s own description of Personal Intelligence. Google says AI Mode can now connect to personal context from apps like Gmail and Photos when a user opts in, making Search responses more tailored to that person.

So we tested the obvious next question:

If AI Mode can use personal context, can that context change which brands it recommends?

We created a controlled experiment using three Google accounts: a blank control account, a blank account connected to Personal Intelligence, and my own mature Google account with years of real personal history. Then we seeded brand signals into Gmail and Photos, ran recommendation prompts across eight product and service categories, and measured whether those brands appeared more often in AI Mode.

They did.

Across 1,922 AI Mode responses, seeded brands in the Personal Intelligence-connected account saw statistically significant visibility lift compared with the control account. Gmail-based signals were far stronger than Photos. The brands did not just appear more often. They moved into more prominent recommendation positions.

This has major implications for AI search measurement. Brand visibility is starting to move beyond the public web and into the user’s personal context.

What We Wanted to Test

Google’s Personal Intelligence documentation describes a clear direction toward AI that can understand personal context across Google products. The feature is opt-in. Google says users control whether to connect sources like Gmail and Photos to AI Mode, and those connections are off by default.

That opt-in detail matters.

We were testing whether AI Mode connected to Personal Intelligence would behave differently after brand signals were added to connected personal data sources. Not AI Overviews or the default version of AI Mode.

Primary research question:

When Google AI Mode is connected to Personal Intelligence, can seeded personal-context signals influence which brands appear in recommendations?

Secondary questions:

Does a Personal Intelligence-connected account show higher brand lift than a non-connected control?
Do Gmail signals create more lift than Google Photos signals?
Do seeded brands move into more prominent positions, such as top 3 or top 10?
Does the effect vary by category?
Does prompt framing change how much personalization appears?
Does AI Mode ground personalized recommendations in citations, or does it surface brands without external support?

The Hypothesis: Personal Context Can Shift the Recommendation Set

If Google Personal Intelligence meaningfully influences AI Mode recommendations, then brands seeded into connected personal data sources should appear more often, earn more share of voice, and move into more prominent recommendation positions than the same brands in a control account without the Personal Intelligence connection.

Secondary Hypotheses:

Hypothesis	Rationale
Gmail seeds will outperform photo seeds	Recommendation-style email gives AI Mode explicit text signals.
Real brands will outperform fake brands	Real brands have external web entities, citations, and shopping data.
Category effects will vary	AI Mode may be more flexible in subjective consumer categories than trust-heavy categories.
Prompt framing will matter	“Recommend 3” and “safest choices” prompts may trigger more personalized recommendation behavior.
Web grounding will remain active	Personal context may change the brand set, but citations may still come from web sources.

Methodology

Google describes Personal Intelligence as an opt-in feature that connects Google apps such as Gmail and Photos to AI experiences so responses can become more tailored to the individual user. In AI Mode in Search, Google says connecting Workspace and Photos is off by default, and users choose whether to turn it on.

That opt-in distinction was central to this experiment.

We were not testing whether standard AI Mode returns the same recommendation set for every user. We were testing whether AI Mode connected to Personal Intelligence would change brand recommendations after relevant personal-context signals were added to the connected account.

Research Question

We designed the study around one primary question:

When a Google account is opted into Personal Intelligence for AI Mode, can seeded personal-context signals from Gmail or Photos create measurable brand lift in AI Mode recommendations compared with a control account without that Personal Intelligence connection?

Accounts Tested

We used three account conditions.

Account condition	Purpose
Blank control account	A fresh Google account used as the non-personalized comparison point. This account was not connected to Personal Intelligence.
Blank Personal Intelligence account	A fresh Google account connected to Personal Intelligence, then seeded with brand-related signals through Gmail and Photos.
Garrett’s personal account	A mature Google account with long-term personal history across Google products. This account was used to compare experimental seeding against naturally accumulated personal context.

The control account gave us a baseline for AI Mode behavior without the Personal Intelligence connection. The Personal Intelligence account let us test whether seeded Gmail and Photos signals could change recommendations. Garrett’s account let us observe how a long-running personal history might shape brand visibility outside a synthetic seeding protocol.

Categories Tested

We tested eight recommendation categories:

Category

Coffee machines

Streaming services

Banks

SEO agencies

Hoodies

Running shoes

Smartphones

Productivity tools

These categories were selected to include a mix of consumer products, subjective preferences, trust-heavy decisions, software, services, and branded recommendation spaces.

Prompt Design

Each category was tested across six prompt types. The prompts were designed to capture different recommendation behaviors, including:

Prompt type	Example pattern
General “best” prompt	“What are the best [category] right now?”
Use-case prompt	“What [category] are best for [specific use case]?”
Recommendation-list prompt	“If you had to recommend 3 [category], which would you choose?”
Risk-reduction prompt	“What [category] are the safest choices?”
Trust prompt	“Which [category] are the most reliable?”
Category-specific utility prompt	Prompts tailored to each product or service category.

This allowed us to test whether Personal Intelligence effects varied based on how the user framed the query.

Seeding Design

For each category, we selected brands for different seed types:

Category	Photo seed	Email seed	Fake email seed
Running Shoes	Enko	Salomon	Velstride
SEO Agency	Previsible	RevenueZen	Northpeak SEO
Banks	PNC Bank	Truist	HarborTrust
Coffee Machines	Jura	Ratio	Brewform
Hoodies	Russell Athletic	Uniqlo	Greyfen Apparel
Smartphones	Unihertz	Sonim	Ironclad Mobile
Streaming Service	Philo	Tubi	Streamwell
Productivity Tools	Things 3	TickTick	Kavro

The email seeds were written as recommendation-style messages. The photo seeds were added to Google Photos. The fake email seeds let us test whether Gmail context alone could surface a brand with limited or nonexistent external brand authority.

What the Seeded Emails Looked Like

The Gmail seeds were short, recommendation-style messages. Each email connected a brand to a category and a reason someone might choose it.

We used similar recommendation-style emails across categories. We also tested fictional brand names to see whether Gmail context could introduce a brand with little or no external footprint.

Fake Brand Stress Test

We included a fake-brand condition to test whether Personal Intelligence-connected Gmail context could introduce a new brand association into AI Mode recommendations.

For each category, we created a fictional brand and seeded it through a recommendation-style Gmail message using the same format as the real-brand email seeds. These fictional brands included Velstride, Northpeak SEO, HarborTrust, Brewform, Greyfen Apparel, Ironclad Mobile, Streamwell, and Kavro.

This condition helped separate two possible behaviors:

Test condition	What it helped us evaluate
Real brand email seeds	Whether Gmail context could reinforce brands with existing public web entities.
Fake brand email seeds	Whether Gmail context could introduce a novel brand association from the user’s personal data alone.

Since the fake brands were fictional, we did not expect AI Mode to cite public web sources for them. For this condition, the key question was whether the brands appeared in recommendations after being seeded through Gmail.

Data Collection

The study analyzed 1,922 AI Mode responses and 22,064 brand-level rows collected between March 30 and April 15, 2026. Each brand-level row represented a brand appearing in one AI Mode response.

We tracked:

Account condition
Date
Category
Prompt
Brand mentioned
Brand rank or position
Whether the brand was seeded
Seed type
Whether the brand appeared in top 3
Whether the brand appeared in top 10
Cited URL
Cited domain
Citation source type
Personalization language
Strong recommendation language

Metrics

We ran each prompt manually in AI Mode across all three Google accounts, then used a custom JavaScript snippet in the browser to extract response text, brand mentions, ranks, and citation URLs directly from the page DOM. The dataset was processed in Python, where we used natural language processing to classify personalization signals and strong-recommendation language, computed the visibility, prominence, and citation metrics described above, and ran the difference-in-differences comparison between the Personal Intelligence-connected account and the control.

Metric	Definition
Appearance rate	Share of AI Mode responses where a seeded brand appeared.
Seeded share of voice	Share of visible brand mentions captured by seeded brands.
Top-3 rate	Share of responses where a seeded brand appeared in the first three recommendation positions.
Top-10 rate	Share of responses where a seeded brand appeared in the first ten recommendation positions.
Citation rate	Share of brand mentions that included a cited source.
Coverage-aware rank	Average rank across every response. Brands missing from a response count as rank 20, so brands that rarely appear get pulled toward 20. Lower is better.
Personal context signal rate	Share of responses with visible signs of personal-context use, such as Gmail, Photos, or user-specific phrasing.
Strong recommendation language	Responses with clear endorsement language, reliability signals, or low hedging.

Analysis Approach

We used a difference-in-differences approach to compare brand lift in the Personal Intelligence-connected account against the control account.

This helped separate actual treatment lift from general AI Mode volatility. For example, if a brand became more common in both the Personal Intelligence account and the control account, that would be weaker evidence of personalization. If lift appeared mainly in the Personal Intelligence-connected account after seeding, that would suggest the connected personal-context layer played a role.

The strongest topline result was that seeded brands became 46 percentage points more likely to appear in the Personal Intelligence-connected account versus control. Seeded brands also gained 7.2 points in share of voice, 23.1 points in top-3 placement, and 42.8 points in top-10 placement.

What This Methodology Does and Does Not Show

This methodology can show whether brand recommendations changed after personal-context seeding in an account connected to Personal Intelligence.

It cannot show Google’s internal ranking logic. We observed AI Mode outputs, citations, brand mentions, and recommendation positions. We did not have access to Google’s internal retrieval process, model weights, ranking systems, or Personal Intelligence decision layer.

For that reason, the study should be read as evidence of observed brand lift in AI Mode under an opted-in Personal Intelligence condition, not as a claim about a universal ranking factor in default AI Mode.

Finding 1: Seeded brands lifted sharply in the Personal Intelligence account

The clearest result was the lift in the account connected to Personal Intelligence.

Seeded brands became much more likely to appear in AI Mode recommendations after seeding, and the lift was statistically significant versus control.

Seeded-brand appearance rose from 23.9% to 66.8% in the Personal Intelligence-connected account. In the control account, it moved from 21.9% to 18.9%.

That is the core finding.

The seeded brands did not simply show up more often. They moved into more prominent positions. Top-3 placement improved from 4.5% to 24.9%, and top-10 placement improved from 17.7% to 54.6% in the seeded Personal Intelligence account.

The time-series matters. The lift was not a slow background trend. The Personal Intelligence account tracked close to control early in the test, then separated after the seed events. The control account stayed relatively flat.

That pattern makes the result harder to dismiss as normal AI Mode volatility.

Finding 2: Gmail signals beat photo signals

The Personal Intelligence account did not respond to every seeded signal equally.

Gmail was the stronger signal by a wide margin.

Seed type

Appearance rate

Avg rank when present

Coverage-

aware rank

Top-3 rate

Top-10 rate

Citation rate

53.6%

6.78

12.92

18.7%

42.7%

11.35%

Photo

10.5%

7.82

18.73

1.1%

8.3%

0.54%

Email seeding drove more appearances, better visibility, stronger ranking positions, and far more citations. Photo seeding was weak in most categories.

On average rank when present, which only counts responses where the brand actually appeared, the two seeding types look almost identical: 6.78 for email versus 7.82 for photo. But that metric hides the bigger problem. Coverage-aware rank includes every response, assigning a rank of 20 to absent ones and averaging across all of them, so brands that rarely appear get pulled toward 20. Email scored 12.92. Photo scored 18.73, almost the absence penalty itself. The metric reveals what average rank obscured: photo brands mostly weren’t appearing in the first place.

This finding makes intuitive sense.

Email seeds were text-rich. They were recommendation-oriented. They gave the system explicit language connecting a brand to a use case, preference, or decision. Photos were weaker, especially when the image alone did not clearly translate into a brand preference.

There was one clear exception: coffee machines.

Coffee machines were the only category where photo seeding produced meaningful movement. Jura was already a known premium brand, so the photo signal may have reinforced an entity AI Mode already understood.

The practical takeaway:

In this test, a recommendation sitting in Gmail mattered more than a brand sitting in Google Photos.

For brands, that points directly at lifecycle marketing. Product education emails, recommendation emails, receipts, onboarding emails, renewal notices, and newsletters may become part of the personal context layer AI systems retrieve from.

That does not mean brands should spam people. It means owned audience touchpoints may have search implications when AI Search gets personal.

Finding 3: Fake brands appeared after Gmail seeding

We wanted to push the test further.

Real brands have existing entity footprints. They may have websites, reviews, product pages, shopping data, news coverage, comparison mentions, and other signals across the web. So when a real brand appears after being seeded into Gmail, there are two possible explanations:

Personal Intelligence picked up the Gmail context.
AI Mode connected that Gmail context to a brand it could already understand from the web.

So we ran a fake-brand stress test.

For each category, we created a fictional brand and seeded it through Gmail using the same recommendation-style format as the real-brand email seeds. These included names like Velstride, Northpeak SEO, HarborTrust, Brewform, Greyfen Apparel, Ironclad Mobile, Streamwell, and Kavro.

The question was simple:

Could a brand with no real-world footprint appear in AI Mode recommendations after being introduced through Gmail?

It did.

Fake email-seeded brands appeared in 35.7% of relevant responses. Real email-seeded brands performed better, appearing in 55.8% of responses, but the fake-brand appearance rate is the more provocative finding.

The citation rate needs the right interpretation.

Fake brands had a 0% citation rate, which is exactly what we should expect. These brands did not exist. There were no websites, product pages, reviews, articles, shopping listings, or third-party references for AI Mode to cite.

So the fake-brand test should not be judged by citation rate. It should be judged by whether Gmail-seeded personal context could introduce a novel brand association into the recommendation set.

That is what happened.

The better read is this:

Gmail context appeared strong enough to introduce a fictional brand into AI Mode recommendations, but real brands still had the advantage of external web grounding.

That matters for brands, but it needs a disciplined takeaway.

This does not mean brands can manufacture durable AI visibility out of nothing. The fake brands appeared, but they had no external proof layer. Real brands had something the fake brands did not: entity support, web presence, and sources AI Mode could cite.

The strategic lesson is more useful:

Personal context may help a brand enter the answer. The web helps make that brand credible inside the answer.

Finding 4: Category mattered

The lift was not evenly distributed.

Some categories were much more responsive to Personal Intelligence seeding than others.

Coffee Machines, Hoodies, Running Shoes, and Smartphones had stronger share-of-voice lift. Banks, Productivity Tools, Streaming Services, and SEO Agency were harder to move.

This is where the study gets more useful.

Personalization appears more flexible in categories where preference plays a larger role. Hoodies, running shoes, and coffee machines all have obvious preference signals. Style, comfort, use case, budget, taste, and perceived quality matter.

Expertise-heavy and trust-heavy categories were harder to shift.

SEO agencies barely moved. Banks moved, but less dramatically than consumer product categories. That is probably a healthy thing. We should not expect one email recommendation to rewrite trust-heavy AI recommendations in financial services or expert B2B services.

Brand-level results show the same pattern.

Ratio and Salomon saw massive gains. RevenueZen moved, but far less. Enko did not move at all. Philo dropped despite seeding.

The takeaway:

Personalization is not equally malleable across categories.

For brands, this means AI search strategy needs category-specific testing. A playbook for running shoes may not translate to banking. A playbook for streaming may not translate to B2B services.

Finding 5: Real account history created its own brand-affinity profile

The synthetic account showed that seeded context could influence AI Mode.

My personal account showed something different: a mature Google account can create its own brand-affinity profile.

Some brands tied to my real history surfaced at much higher rates than they did in the control account.

These were not the primary experimentally seeded brands. They appeared to reflect real account history.

This is one of the more important strategic implications from the study.

The seeded account tells us that a concentrated signal can move brand recommendations. The personal account suggests a user’s long-term exposure, receipts, subscriptions, preferences, and behaviors may already shape the AI Mode recommendation set.

That matters for brands.

If a person has years of emails from Nike, Starbucks, Bank of America, Delta, Sephora, The North Face, or any other brand, that context may matter when AI Mode answers a future recommendation query.

The study also showed that concentrated seeding often outperformed organic history for the same brands.

The likely reason is concentration. The synthetic seed was recent, focused, and repeated in a narrow test window. Organic history is messier. Real people have fragmented inboxes, mixed signals, old purchases, abandoned interests, and preferences that change.

Google’s own Personal Intelligence documentation names this type of challenge. It calls out risks like overpersonalization, mistaking another person’s preferences for the user’s own, mixing timelines, and making incorrect assumptions from transaction records.

That is exactly why this space matters.

AI personalization will not always be clean. Real human context is noisy.

Finding 6: Prompt framing changed the strength of personalization

The way the question was asked changed the result.

Some prompts produced massive seeded-brand lift. Others did not.

The report found that constrained recommendation prompts, especially prompts asking AI Mode to “recommend 3,” often showed the strongest seeding effects. Broad “best right now” prompts were less consistent.

This is a measurement problem.

Many AI search tracking setups use one prompt per topic. That will miss a lot. Prompt framing can change which brands appear, which citations are used, and whether personal context gets pulled into the answer.

A user asking “What are the best coffee machines right now?” may get a different style of answer than someone asking “If you had to recommend three coffee machines for everyday use, which would you choose?”

The first prompt may trigger broad category consensus. The second may push the system to make a tighter recommendation. That tighter recommendation set may create more room for personal context.

The report found a similar pattern with strong recommendation language. Prompts using “safe,” “dependable,” or “reliable” framing produced high-confidence recommendations more often than generic “best right now” or use-case prompts.

The takeaway:

Prompt design is measurement design.

If you change the prompt, you may change the visibility result. That makes AI search measurement much more complex than keyword rank tracking.

Finding 7: Personalization did not replace web grounding

The most interesting part of Personal Intelligence is that it connects AI Mode to personal context.

But that did not make the public web irrelevant.

AI Mode still grounded many brand recommendations in web sources.

Other brand sites (including both indirect and direct competitor sites) were the largest citation source. For example, Other brand sites include Capital One, which is mentioned on chime.com, both in the financial industry. But Dick’s selling Hokas running shoes is part of Retailer, because they are not competitors.

Brand-owned sites and Google Shopping were major sources, too. Fully uncited or personal-context-only mentions were a smaller share.

This gives us a more nuanced read.

Personal context may influence which brands enter the recommendation set. The public web may still help AI Mode justify, enrich, or validate the recommendation.

The post-seeding citation behavior supports that idea. The report found that no-citation behavior dropped after seeding:

After seeding, AI Mode appeared to connect more brand mentions to external sources instead of surfacing brands without citations.

That is a big Relevance Engineering point.

Personalization does not remove the need for strong web presence. It may raise the bar. A brand needs to be visible in the user’s context and legible to AI systems through the broader web.

For product categories, Google Shopping mattered. For categories like banks, streaming, productivity tools, and SEO agencies, brand-owned sites and other brand sites carried more weight.

The AI answer is built from multiple layers:

Personal context
Prompt framing
Brand entity understanding
Owned site clarity
Third-party validation
Product data
Citation availability
Category-level authority

The brands that succeed with AI Search will need to show up across more than one layer.

What this means for brands

The old measurement model fundamentally breaks for AI Search.

Classic SEO measurement assumes a fairly stable public result set. Rankings vary, sure. Location, device, history, and personalization have mattered for years. But AI Mode plus Personal Intelligence introduces a more direct personalization layer.

Now the question changes from:

“What does AI Mode recommend?”

to:

“What does AI Mode recommend for this user, with this history, asking this prompt, in this category?”

That is a much harder problem. It is also the real problem.

1. AI Search visibility is becoming user-specific

One prompt does not equal one universal answer.

A Personal Intelligence-connected account may produce a different brand set than a blank control account. A mature account may produce yet another brand set based on years of personal context.

Brands need to measure visibility across:

Account types
Persona profiles
Prompt variants
Categories
Intent stages
Citation source types
Recommendation strength

This is where AI search visibility starts to look less like rank tracking and more like audience research, entity analysis, and retrieval testing mashed together.

2. Email may become part of AI Search influence

Gmail was the strongest tested signal in this experiment.

That does not mean brands should jam inboxes with garbage. That would be a race to the bottom, and users will opt out, unsubscribe, ignore, or filter it.

The better takeaway is that useful customer communication may have a second-order effect in AI-mediated discovery.

Think about:

Product recommendation emails
Order confirmations
Receipts
Renewal notices
Loyalty emails
Educational nurture sequences
Post-purchase support emails
Comparison guides
Back-in-stock messages
Event confirmations
Account summaries

Those touchpoints already shape user memory. Personal Intelligence may allow them to shape AI memory too.

The new CRM question is not just “Did they click?”

The new question is:

Did this communication help the user and create a durable brand association that AI can retrieve later?

3. Brand authority still matters

The citation data is a reality check.

AI Mode did not just pull seeded brands from thin air. It often connected brand mentions to the web: other brand sites, owned sites, Google Shopping, news, retailers, review sources, and reference sites.

Brands still need:

Clear owned-site information
Strong product pages
Entity consistency
Third-party validation
Review visibility
Comparison visibility
Shopping feed quality
Digital PR
Category authority

The user’s inbox may help introduce a brand into the conversation. The web still helps AI Mode decide what to say about that brand.

4. Measurement needs to move past rank tracking

AI Search measurement needs more dimensions.

Measurement layer	Metrics
Visibility	Appearance rate, share of voice
Prominence	Top-3, top-10, rank
Grounding	Citation URL, citation domain, source type
Persuasion	Strong recommendation language
Personalization	Personal context signal rate
Category strategy	Prompt-type performance

Rank still matters. It just does not capture enough.

If two brands both appear in AI Mode, the next questions are:

Which one appears first?
Which one is framed as safer?
Which one gets stronger language?
Which one gets cited?
Which source gets cited?
Which one appears for a blank account?
Which one appears for a mature personal account?
Which one appears for a user with relevant emails?
Which one appears in a constrained recommendation prompt?

That is the measurement work now.

What this study does and does not prove

This study needs a clear boundary.

It shows observed brand lift under an opted-in Personal Intelligence condition. It does not reveal Google’s internal ranking logic.

What this study supports

Supported by this study

Seeded brands in the Personal Intelligence-connected account saw statistically significant lift versus control.

Gmail seeding outperformed photo seeding in this test.

Seeded brands moved into more prominent positions, including top-3 and top-10 placements.

Brand lift varied by category and prompt framing.

AI Mode often grounded brand recommendations in web sources.

A mature personal Google account produced a distinct brand-affinity profile.

What this study does not prove

Out of scope
Gmail is a universal ranking signal.
Personal Intelligence affects default AI Mode results for users who have not opted in.
Photos never matter.
All industries will behave the same way.
Brands can reliably manipulate AI Mode recommendations.
We know Google’s internal retrieval or ranking process.

A few limitations matter.

First, this tested opted-in Personal Intelligence, not default AI Mode. Google says AI Mode connections to Workspace and Photos are off by default and controlled by the user.

Second, we observed outputs. We did not observe Google’s internal retrieval, ranking, model decision systems, or Personal Intelligence engine.

Third, this was a short study window from March 30 to April 15, 2026. Longer studies are needed to measure persistence, decay, and model changes.

Fourth, the account sample was small. The test design was useful for controlled comparison, but it does not represent the full range of Google users.

Fifth, the photo test should be treated carefully. Photo seeding was weak in most categories, but better image labeling, different image types, more time, or different categories could change the result.

Sixth, personal-context cards were not fully captured. The report notes that we cannot determine what share of no-citation brand mentions came specifically from Gmail personalization versus broader model knowledge.

That is why the safest and strongest claim is this:

In this experiment, brands seeded into a Personal Intelligence-connected account saw statistically significant AI Mode visibility lift compared with a non-connected control account, with Gmail-based signals creating the strongest observed effect.

That is plenty.

What we would test next

This experiment opened more questions than it closed.

That is good. It means the next round can get sharper.

1. More Accounts

We need larger cohorts:

More blank controls
More Personal Intelligence-connected accounts
More mature personal accounts
Persona-seeded accounts
Different locations
Different devices
Different Google account histories

2. Isolated Signal Types

The next version should isolate each personal-context source:

Gmail only
Photos only
Search history where available
YouTube where available
Calendar if supported and documented
Receipts vs. newsletters vs. recommendations
Order confirmations vs. product education

3. Email Behavior Variants

Gmail was the strongest signal, so Gmail needs deeper testing.

Test:

Opened vs. unopened
Clicked vs. unclicked
Starred vs. unstarred
Archived vs. inbox-visible
Deleted after exposure
Multiple senders
Different email formats
Different recency windows
Repeated mentions vs. one mention

4. Signal Decay

Does the lift last?

Track:

24 hours
3 days
7 days
14 days
30 days

This is one of the most commercially important follow-ups. If brand lift decays fast, it points toward recency. If it persists, it points toward durable account-level brand association.

5. More Verticals

The category differences were too large to ignore.

Next categories should include:

Travel
Retail
Healthcare
Insurance
Financial products
Restaurants
B2B software
Home services
Auto
Education

6. Better Prompt Taxonomy

The prompt results show that tracking one prompt per category is a weak measurement.

Future testing should group prompts by:

Exploratory research
Transactional intent
Risk reduction
Comparison
Replacement
Loyalty
“Recommend 3”
“Best for me”

We don’t need to focus on whether Personal Intelligence can change recommendations. That’s expected.

The question is which signals matter most, for which users, in which categories, and for how long.

The iPullRank POV: AI Search visibility is a context problem

This is where SEO and GEO continue to move.

Classic SEO asked:

Where do we rank?
Which page gets traffic?
Which keyword converts?
Which page got cited?

AI search adds harder questions:

Which brand gets recommended?
For which user?
With which personal context?
Under which prompt framing?
With which cited source?
With what confidence language?
Across which AI surface?

Personal Intelligence raises the stakes, but it’s more difficult to strategize for at scale. A user’s connected data may influence what AI Mode retrieves, how it reasons, and which brands it recommends.

That means brands need to think across three visibility layers.

Layer	What brands need to build
Public web	Owned-site clarity, content quality, entity signals, third-party validation
AI retrieval layer	Passage-level relevance, citation-worthiness, category coverage
Personal context layer	Useful email, CRM, product education, receipts, recommendations, post-purchase content

It all brings us back to Relevance Engineering.

It is the work of making a brand retrievable, recommendable, and credible across the systems AI uses to construct answers.

That includes the public web. It includes third-party sources. It includes structured product data. It includes digital PR. It includes content strategy. And now, with opt-in Personal Intelligence, it may include the customer’s own personal data trail.

If your AI Search measurement assumes every user sees the same recommendation set, you are missing the personalization layer.

AI Search visibility is about more than prompt tracking from one clean “lab” account and calling it a strategy.

It’s about understanding how brands appear across prompts, personas, contexts, citations, and personal histories.

AI Mode is personal without Personal Intelligence. But when people opt-in, we’re marketing to every single grain on a beach.

Marketing needs to reframe our approach to, and expectations for, organic search.

Want to learn more?

Read About Our AI Search Strategy Program

//.about_the_author

Garrett Sussman

Garrett is the Marketing Director at iPullRank, a MozCon 2024 Speaker, and host of The SEO Weekly. When he's not crafting content, he's scouting the perfect ice coffee, playing with his daughter, devouring a non-fiction psychology book, or concocting a new recipe in the kitchen.

AI Mode, Generative AI, SEO

Your Inbox Might Be the Next AI Search Signal

by Garrett Sussman

05.21.2026

//.study_co_authors

Garrett Sussman

Marketing Director

Michael Tandoh

SEO ENGINEER

Cate Dombrowski

ASSOCIATE BI ANALYST

We Tested Google Personal Intelligence. Gmail Signals Changed AI Mode Brand Recommendations.

What We Wanted to Test

The Hypothesis: Personal Context Can Shift the Recommendation Set

Secondary Hypotheses:

Methodology

Research Question

Accounts Tested

Categories Tested

Prompt Design

Seeding Design

What the Seeded Emails Looked Like

Fake Brand Stress Test

Data Collection

Metrics

Analysis Approach

What This Methodology Does and Does Not Show

Finding 1: Seeded brands lifted sharply in the Personal Intelligence account

Finding 2: Gmail signals beat photo signals

Finding 3: Fake brands appeared after Gmail seeding

Finding 4: Category mattered

Finding 5: Real account history created its own brand-affinity profile

Finding 6: Prompt framing changed the strength of personalization

Finding 7: Personalization did not replace web grounding

What this means for brands

1. AI Search visibility is becoming user-specific

2. Email may become part of AI Search influence

3. Brand authority still matters

4. Measurement needs to move past rank tracking

What this study does and does not prove

What this study supports

What this study does not prove

What we would test next

1. More Accounts

2. Isolated Signal Types

3. Email Behavior Variants

4. Signal Decay

5. More Verticals

6. Better Prompt Taxonomy

The iPullRank POV: AI Search visibility is a context problem

Want to learn more?

//.about_the_author

Popular Articles

//.recent_content

AI Mode, AI Overviews, Generative AI, SEO

05.26.26

Agency, Content, SEO

05.22.26

AI Mode, Generative AI, SEO

05.21.26

AI Mode, AI Overviews, Generative AI, SEO

05.20.26

Tips, advice, and exclusive insight direct to your inbox

You don’t want to beat the algorithm—you want to crush the competition.

© 2025 iPullRank