Knowledge Graph Optimization and Entity Authority

Build entity authority AI can verify

AI doesn’t recommend brands.

AI recommends entities it can verify.

And if your “entity” isn’t clean, consistent, and well-referenced across the web… you get the modern marketing nightmare:

The AI mixes you up with another company.
It grabs an old founder bio from 2018 and treats it like gospel.
It invents a headcount, a headquarters, a tagline, a parent company.
It says you do “enterprise AI consulting” when you sell wedding photography.

Why?

Because the model is trying to answer a very simple question:

“What is this thing?”

…and when it can’t answer confidently, it builds a story out of whatever it finds.

Dave’s line on this is blunt for a reason:

“If you don’t define your brand for AI, AI will define it for you using whatever it finds.”

This pillar post is about taking control of that “whatever it finds.”

Not with vibes.

With structure:

canonical brand facts
sameAs links
entity IDs
knowledge graph references
and a workflow to fix wrong answers like it’s an operational system (because it is)

What entity authority means

Let’s demystify the buzzword.

Entity authority is the probability that a machine can:

Recognize your brand as a distinct entity (not a keyword, not a logo, not “some company”).
Disambiguate you from similarly named entities.
Retrieve stable, consistent facts about you from sources it trusts.
Cite those sources (or at least align with them) across answers.

Google has been moving this direction for years. Their Knowledge Graph framing is literally “things, not strings.”

And it’s not just “search.”

AI interfaces are now showing entity panels and stitched summaries where the system is pulling from third-party sources and citing them. In AOK’s own experiment, the takeaway was clear: the panel wasn’t just homepage copy — it was assembled from sources across the web.

Like this Starbucks example:

So entity authority is less “rank my page” and more:

Can the machine confidently resolve you?

Or to use another AOK phrase:

SEO is becoming Entity Operations.

Old SEO was: keywords, pages, links.
AI-era SEO is: entity clarity, source control, fact consistency, authoritative references.

Why does AI get our company wrong?

Here are the most common reasons I see — and why they’re predictable.

1) Name collisions (you share a name with someone else)

If you’re “Summit Marketing” or “Blue River” or “Axis”… there are probably 14 of you.

Machines don’t “intuit” which one is you. They match patterns.

If your footprint is weak, you get merged.

2) Your own site is inconsistent

These are silent killers:

“Acme, Inc.” vs “Acme” vs “Acme Labs”
3 different founding years across pages
old office address still on the footer
phone number differs on contact page vs schema vs Google Business Profile
2 “About” pages with different positioning statements

If you can’t keep your own facts straight, don’t expect AI to do it for you.

3) Third-party sources contradict each other

AI systems build trust by triangulating.

If Crunchbase says one thing, LinkedIn says another, and your website says a third… you’ve basically told the model:

“Pick one. Good luck.”

And it will.

4) You don’t have “identity anchors”

Machines love stable identifiers.

That’s where sameAs and entity IDs come in.

Schema.org defines sameAs as a URL that unambiguously indicates identity (Wikipedia, Wikidata, official site, etc.).

Think of sameAs as your digital fingerprint. Not the whole solution — but a crucial piece.

5) Your brand isn’t “worth being careful about” (yet)

This one hurts, but it’s real.

Systems allocate confidence based on signals. If you’re a small brand with little coverage and inconsistent data, the model’s uncertainty is higher.

In AOK’s ChatGPT entity panel post, the point is framed in business terms: entity panels affect trust and decisions by investors, journalists, procurement, candidates, regulators — not just traffic.

If those people are using AI to form first impressions, your “entity clarity” becomes PR.

The single source of truth: your Brand Facts Sheet

If you do nothing else after reading this, do this:

Build a Brand Facts Sheet.

Not a fluffy brand deck.

A factual, canonical, version-controlled doc that answers:

What are the official facts about the entity?
What are the approved descriptions?
What IDs and references define it across the web?

Because when facts drift, you need a place to point and say:

“This is the source of truth.”

Brand Facts Sheet template

Use this as a starting point (copy/paste into a Google Doc, Notion, or a spreadsheet).

1) Identity block (the “who are we?” core)

Canonical name (brand name used publicly)
Legal name (registered name, if different)
Alternate names (DBAs, abbreviations, old names)
Tagline (current)
One-sentence description (approved)
Long description (approved 80–150 words)
Category (what you are — pick the simplest correct label)
Primary offer (what you sell)
Primary audience (who you serve)

2) Key facts block (the “things AI gets wrong” block)

Founding date (YYYY-MM-DD if possible)
Founders (names + titles)
CEO / key executives (names + titles)
Headquarters (city, region, country)
Primary phone
Primary support email
Primary physical address (if applicable)
Service area (if local/regional)
Parent company / subsidiaries (if applicable)

3) Web presence block (canonical URLs only)

Homepage URL
About page URL (canonical)
Contact page URL (canonical)
Press page / media kit URL
Logo URL (crawlable)
Brand images / headshots (URL list)

4) Social profile block (official handles)

LinkedIn company page
X
Instagram
YouTube
TikTok
Facebook
GitHub (if relevant)
Substack / Medium (if official)

5) Entity references block (the “machine anchors”)

Wikidata QID (if exists)
Wikipedia page (if exists)
Google Business Profile (if applicable)
Apple Business Connect (if applicable)
Industry directories that matter (G2 / Capterra / Clutch / etc.)
Data providers relevant to your vertical

6) Change log (this is what makes it operational)

Track:

what changed
when
why
who approved it

Because your brand facts will change (new address, new CEO, rebrand), and you need controlled updates.

sameAs + entity IDs checklist

This is the “entity footprint” work. This is where you stop being “just a website” and start being a resolvable entity.

Step 1: Create a canonical entity ID (source of truth) on your own site

You want one stable identifier for your organization, usually as a URL.

Example:

https://example.com/#organization

That becomes your internal “entity node” you reference in structured data.

Step 2: Add Organization structured data (and do it right)

Google explicitly supports Organization markup and recommends putting it on the homepage or a single page like an About page (not necessarily every page).

They also recommend focusing on properties useful to users (name/alternateName, real-world presence like address/telephone, online presence like url/logo).

And importantly: Google calls out that url helps uniquely identify your organization.

Step 3: Use sameAs like a sniper, not a shotgun

Schema.org’s definition is very specific: sameAs should point to a page that unambiguously indicates identity.

Google’s Organization documentation describes sameAs as links to pages on other websites with more info about your organization (social, review sites) and notes you can provide multiple sameAs URLs.

Rule: Only link profiles/pages that are the same entity.

Not “related.”

Not “partner.”

Not “a random directory page that mentions us once.”

Step 4: Validate your structured data

Google’s structured data guidelines recommend supported formats and explicitly list JSON-LD as recommended.

They also note you can test compliance using tools like Rich Results Test and URL Inspection.

And they’re clear on a key quality point: don’t mark up content that isn’t visible to readers — the markup and page content should match.

Step 5: Build the Entity ID Map (your “sameAs inventory”)

This is the part most brands skip.

They add schema once.

Then they forget it.

Instead, build a living “Entity ID Map” that includes:

Platform
Canonical URL
Entity ID (if the platform has one)
Ownership status (claimed/unclaimed)
Notes (what needs fixing)

Here’s the checklist of what to include:

sameAs / entity reference candidates (pick what’s real for you)

Tier 1: Always

Official website (canonical URL)
LinkedIn company page
Any other official social profiles
Official YouTube channel (if active)

Tier 2: Often

Crunchbase
GitHub org
App stores / marketplace profiles (if SaaS)
Major review platforms relevant to your industry
Major media profiles (if you have them)

Tier 3: Knowledge graph anchors

Wikidata item (QID)
Wikipedia page (if legitimately earned)

Tier 4: Local / regulated / compliance

Google Business Profile (local)
Apple Business Connect (local)
Government registries (where appropriate)
Industry accrediting bodies (where appropriate)

What to put in your Organization JSON-LD (example)

Here’s a practical JSON-LD skeleton you can adapt.

{

“@context”: “https://schema.org”,

“@type”: “Organization”,

“@id”: “https://example.com/#organization”,

“name”: “Example Brand”,

“legalName”: “Example Brand, Inc.”,

“alternateName”: [“Example”, “Example Co.”],

“url”: “https://example.com/”,

“logo”: “https://example.com/assets/logo.png”,

“description”: “Example Brand helps X do Y with Z.”,

“foundingDate”: “2016-04-12”,

“address”: {

“@type”: “PostalAddress”,

“streetAddress”: “123 Main Street”,

“addressLocality”: “New York”,

“addressRegion”: “NY”,

“postalCode”: “10022”,

“addressCountry”: “US”

“telephone”: “+1-646-555-1234”,

“sameAs”: [

“https://www.linkedin.com/company/examplebrand/”,

“https://www.wikidata.org/wiki/Q123456”,

“https://en.wikipedia.org/wiki/Example_Brand”

]

}

Two important notes:

Don’t add foundingDate, address, executives, etc. unless you’re willing to keep them accurate everywhere. “Facts drift” is a real brand risk. (That’s why you need the facts sheet.)
If you’re a local business, Google specifically notes LocalBusiness has required properties and additional guidance.

Do we need Wikidata or Wikipedia?

Let’s answer this like adults.

You don’t need Wikipedia to show up in AI answers.

But:

Wikipedia and Wikidata are common reference points in knowledge systems.
The Knowledge Graph itself has historically cited sources like Wikipedia (and previously Freebase).
Wikidata is designed as a machine-readable, editable knowledge base.

So “need” isn’t the right word.

The right word is:

Leverage.

If you can legitimately earn those references, they become strong identity anchors.

But you can also build a strong entity footprint without them by getting your site, schema, and third-party references consistent and authoritative.

Wikidata/Wikipedia readiness path

This is where PR/Comms teams either win… or get themselves into a mess.

Step 1: Understand the difference

Wikipedia is narrative + citations + editorial standards.
Wikidata is structured facts + references + IDs.

Wikidata’s own framing: it’s a free and open knowledge base readable and editable by humans and machines.

Step 2: Know the bar for Wikipedia

Wikipedia’s notability guidance for organizations is blunt:

Notable if there is significant coverage in reliable, independent secondary sources.
If no independent, third-party reliable sources exist, Wikipedia shouldn’t have an article.

Translation:

Your press release doesn’t count.
Your blog doesn’t count.
Your partner’s blog doesn’t count.
Your paid advertorial doesn’t count.

The game is earned media and independent coverage.

Step 3: Don’t do the classic COI mistake

If you (or your agency) are closely connected to the company, Wikipedia considers that a conflict of interest.

Wikipedia explicitly says COI editing is strongly discouraged, and advises disclosure and proposing changes via talk pages instead of directly editing.

So don’t treat Wikipedia like a profile you “claim.”

Treat it like an encyclopedia you have to earn.

Step 4: A practical “readiness ladder”

Here’s the path I recommend for brands:

Level 0: Not ready

No independent coverage
Inconsistent brand facts everywhere
Basic digital presence is messy

Action: fix your own house first (facts sheet + site consistency + schema + profile cleanup).

Level 1: Wikidata-ready

The entity is identifiable
You can cite verifiable references (official site, reputable sources)
You’re prepared to keep facts accurate

Wikidata has a notability policy: an item is acceptable if it fulfills certain goals/criteria (including having sitelinks and meeting listed criteria).

Action: build a clean Wikidata item with references (and do it ethically).

Level 2: Wikipedia-ready

Multiple independent reliable secondary sources with significant coverage
Neutral tone possible
COI handled properly

Action: do not brute-force publish a page. Use the right process (talk pages / Articles for Creation) and disclose COI.

Level 3: Knowledge panel-ready (the real win)

Consistent entity across web
Strong authoritative references
Clear schema + sameAs
Clean “about” narrative

This is where machines stop guessing.

Fix wrong AI answers workflow

This is the module that turns “AI got us wrong” from a panic into a process.

Because the fix is rarely “tell ChatGPT to stop.”

The fix is almost always:

Fix the sources the AI trusts.

Here’s the workflow.

Step 1: Capture the bad output like evidence

Create a shared doc/spreadsheet with:

Prompt used
Full AI answer
Date/time
Screenshot
Links/citations shown (if any)
What’s wrong (specific facts)
Severity (low / medium / high)

If it’s a public-facing AI panel, treat it like a PR issue.

Step 2: Classify the error

Most errors fall into one of these buckets:

Identity error (entity merge): It’s mixing you with another entity.
Fact drift: It has an old address/CEO/founding date.
Inference/hallucination: It’s guessing because it can’t verify.
Narrative skew: It overweights one controversy or one review source.
Relationship confusion: Parent/subsidiary/partner relationships are wrong.

Different category = different fix.

Step 3: Trace the likely source of truth conflict

Start with the “obvious” anchors:

Your website About / Contact
LinkedIn company page
Major directories in your category
Wikipedia/Wikidata (if present)
Press coverage and bios
Google Business Profile (if local)

Remember: AI panels and summaries are often assembled from third-party sources — not just your homepage.

Step 4: Fix your canonical facts first

Update:

Brand Facts Sheet (source of truth)
Website pages (About, Contact, Press)
Structured data (Organization schema, LocalBusiness if applicable)

Google’s documentation is explicit that organization markup can include details like address/telephone and helps them understand your organization.

Also: keep markup aligned with visible content. If you update a fact in schema, update it on-page too.

Step 5: Fix third-party contradictions (the unglamorous part)

This is the work no one wants to do, but it’s where the wins are:

Claim profiles you haven’t claimed
Correct old addresses / phone numbers
Update executive names and titles
Remove duplicate profiles
Standardize naming (exact punctuation matters more than you think)

Step 6: Strengthen identity anchors with sameAs

If you have stable references, add them.

Schema.org describes sameAs as linking to identity-unambiguous pages like Wikipedia/Wikidata/official site.

Google explicitly supports multiple sameAs URLs for Organization markup.

Step 7: Validate + monitor

Use structured data testing and keep a monitoring cadence.

Google’s guidelines mention testing with tools like Rich Results Test and URL Inspection for technical issues.

Then repeat your AI prompt tests monthly/quarterly and log changes.

Step 8: Create a “Fact Correction” public page (optional but powerful)

This is a secret weapon for PR/Comms teams.

A simple page that answers:

official founding year
official headquarters
official leadership
official product/service definition
official parent/subsidiary relationships
official naming conventions

It becomes a crawlable, citable correction hub.

(And yes, it should match your schema.)

The practical bottom line

AI isn’t sitting there thinking, “How do I represent this company fairly?”

It’s doing pattern matching and confidence scoring.

So your job is to make the pattern obvious:

One entity
One set of facts
Many corroborating references
Clean sameAs links
No contradictions

Or, put in AOK terms:

You’re either shaping the sources… or you’re inheriting the narrative.

What to do now

If you want the fastest path to “verified, consistently described brand across AI answers,” do this in order:

Build the Brand Facts Sheet (today).
Audit your website for fact consistency (About, Contact, footer, schema).
Implement Organization schema with a canonical @id.
Add sameAs links only to true identity anchors.
Clean up your top third-party profiles (LinkedIn, key directories, local listings).
Decide your Wikidata/Wikipedia strategy based on notability + COI reality.
Build the “wrong AI answer” log and turn it into a workflow.

That’s Entity Ops.

And if you want this pillar to work as a real “single source of truth” system (not just a one-time SEO task), assign ownership:

Marketing owns narrative + positioning
PR/Comms owns public references + coverage
Web/SEO owns schema + technical implementation
Someone owns the facts sheet like it’s governance (because it is)

About The Author

Dave Burnett

I help people make more money online.

Over the years I’ve had lots of fun working with thousands of brands and helping them distribute millions of promotional products and implement multinational rewards and incentive programs.

Now I’m helping great marketers turn their products and services into sustainable online businesses.

How can I help you?

See author's posts