Finding Look-Alike Audiences in the Privacy-First Marketing World

Look-alike modeling has been an important part of the media toolkit over the past decade, allowing brands to increase their audience pool by taking a core group of top-performing individuals, grouping them and using data and technology to find other individuals like them.

Over the past several years, data management platforms (DMPs), third-party cookies and their associated data are becoming obsolete due to self-regulation by technology providers and legislation like CCPA and GDPR.

The movement away from third-party cookies and third-party data overlays on cookies is causing total audience pools to drop in size as individuals have fewer associated identifiers (cookies to connect to).

However, look-alike modeling can also help businesses leverage their first-party data to build robust large-scale segments for marketing and advertising purposes.

Tealium’s regional vice president of strategic partnerships for the Americas, Travis Cameron, explained that the value of being able to expand target populations based on data associated with a high-value segment will take on a different dimension.

These changes range from the identifiers used (hashed PII data to match & expand) to the utilization of different data types (contextual, interest-based, pathing) vs. demographic or psychographic inferred data used previously.

“The value remains — marketers need eyeballs, and the quest to find individuals like those who just converted to optimize spend will always have a high associated value as a tactic,” Cameron said. “It is just going to become harder to model and expand your audience.”

Understanding the Machine Learning Model

“Given the complexity of using large amounts of first-party data to build segments, it’s important for businesses to understand how the modeling is reaching its conclusions by understanding which features are used to build the model and their relative importance,” explained Alex Holub, CEO of Vidora. “In addition, it’s important to understand the ultimate goal of the look-alike segment.”

For instance, if the goal is to maximize clicks on a marketing campaign, using a machine learning model that directly optimizes for clicks will yield far better performance than a semi-supervised look-alike model.

Holub explained machine learning offers a few key benefits when building look-alike segments:

First, machine learning is continually adapting segments based on the latest information available. By continually learning, machine learning can incorporate new user activities or actions, in addition to broader secular changes which impact behaviors across all users.

Second, machine learning implicitly learns the importance of user behaviors and attributes, which allows machine learning to leverage all available first-party data and implicitly upweight or downweight the importance of each data point.

“In other words, machine learning can incorporate all available first-party data and learn what information is the most important to the model,” Holub said.

Cameron noted that it’s important to have a clean and correlated data set of your customers that contain like data about each consumer ahead of starting the exercise. “Know the outcome that you are looking for. Any model you’re using needs to be developed and applied with an outcome in mind, so figure out what you trying to optimize with this audience," he said.

He recommended using an agile approach to test, learning quickly and ensuring you have a testing plan in place as you begin to act on the audience response. “Know where you plan to activate the audience through and ensure you have the right identity points and integrations to activate on."

Deploying Data Science to Target Groups

Data science platforms can also offer a fast and reliable solution for businesses to model their first-party data. For instance, Vidora’s product Cortex helps businesses build, understand and integrate look-alike models into their business within a few days.

“For smaller teams, and teams which need to move quickly, a data science platform can augment productivity and be a great value,” Holub said. He added that he currently sees a lot of businesses leverage look-alike modeling as a black box — typically as a component of their DMP.

However, given the increasing organizational dependence on look-alike modeling using first-party data as a key revenue stream, he predicts businesses will begin to leverage more sophisticated data science techniques.

“Using data science techniques should result in both higher quality segments for brands to market to but also segments which align more directly with the brand’s goals," he said.

For example, data science techniques outside of look-alike modeling can build segments that directly optimize for user engagement (e.g., clicks), down-stream conversions and increase in brand sentiment (e.g., uplift modeling).

“I think we will see an increasing reliance on real-time data for look-alike models as the number of first-time users and anonymous users increases,” Holub added.

Learning Opportunities

WebinarJul 22, 2026 · 11:00 AM PDT

Replacing Tasks, Not Roles: The Changing Nature of Contact Center Work

Birds sitting on a tree branch like a content team

WebinarJul 23, 2026 · 11:00 AM PDT

How Fast-Moving Content Teams Keep Up as Sites Grow

WebinarJul 30, 2026 · 11:00 AM PDT

From Automation to Intelligence: How Leading Teams Are Rethinking Operations

WebinarAug 19, 2026 · 9:00 AM PDT

How to Win the War for Agentic Citations: The AEO Playbook You Need Now

Promotional banner for CX Retail USA Exchange 2026, an invite-only customer experience and retail leadership conference in Atlanta on Sept. 14–15, 2026.

ConferenceSep 14, 2026 · 7:30 AM EDT

CX Retail Exchange USA Atlanta 2026

Gaylord Rockies Resort & Convention Center in Aurora, Colorado

ConferenceNov 4, 2026 · 9:00 AM MST

Gartner Customer Service & Support Conference Denver 2026

Prove the significant result not only in soccer

WebinarOn Demand

Content Leaders Collective: Proving Content's Business Impact Starts With the Right CCMS

Watch Now

WebinarOn Demand

Why Some Dealers Are Pulling Ahead With AI

Watch Now

View All

Cameron said companies with smaller adtech and data teams need to focus their efforts on developing and optimizing models and understanding their customers and the key data points that drive them, not engineering and cleansing their data.

“Achieving a state of data automation within which they can work with their full audience data, leverage a few key partners to better deploy their audiences and have strict measurement theses that allow them to execute at the same pace as a much larger organization,” he explained.

Building Look-Alike Audience Modeling in the First-Party Data Era

Understanding the Machine Learning Model

Deploying Data Science to Target Groups

About the Author