Data

Revenue Operations runs on data. If your data is garbage, your forecasts, reports, and processes will be too. This pillar is about building the foundation everything else depends on.

Rationale

Data is the unsexy pillar that makes everything else possible. You can’t forecast accurately with duplicate accounts. You can’t segment customers with missing fields. You can’t measure pipeline velocity if stage dates aren’t captured.

Most RevOps problems that look like process problems or tool problems are actually data problems. The territory is “unfair” because account assignments are based on bad firmographic data. The forecast is wrong because reps aren’t updating opportunity stages. The churn prediction model fails because customer health data was never captured consistently.

This pillar covers:

  • Data Health — Accuracy, completeness, consistency, deduplication
  • Data Infrastructure — How data flows between systems, where it lives
  • Data Administration — Governance, access controls, ownership
  • Data Stewardship — Ongoing maintenance, quality monitoring, standards

Master this, and you become the person who can actually trust the numbers everyone else is guessing at.

Key Concepts

These are the foundational concepts you need to understand to manage data effectively across your revenue systems.

Concept What It Means Why It Matters
Data Health Accuracy, completeness, consistency of records Bad data = bad decisions. Period.
Deduplication Identifying and merging duplicate records Duplicates inflate metrics, confuse reps, waste outreach
Data Enrichment Adding missing firmographic/technographic data Enables segmentation, targeting, lead scoring
Data Governance Policies for who can edit what, and how Prevents data decay, maintains quality over time
Data Lineage Understanding where data originates and how it flows Critical for debugging when numbers don’t match
Master Data Management Single source of truth for key entities (accounts, contacts) Eliminates conflicting data across systems

KPIs This Pillar Impacts

  • Lead Score, Opp Score, Account Score — only as good as the underlying data
  • Forecast Accuracy — garbage in, garbage out
  • Pipeline Velocity — bad data creates friction that slows deals
  • Win Rate — you can’t optimize what you can’t accurately measure

Operational Metrics

  • Data Completeness Rate — % of critical fields populated
  • Duplicate Rate — % of records with duplicates
  • Enrichment Coverage — % of records with firmographic/technographic data

Resources

Start with CRM fundamentals, then move to data quality and enrichment concepts.

Foundational

Resource Source Type Cost Prerequisites
Data Modeling Salesforce Trailhead Module Free None
Data Quality Salesforce Trailhead Module Free Basic Salesforce
Data Management Salesforce Trailhead Module Free Basic Salesforce
Duplicate Management Salesforce Trailhead Module Free Basic Salesforce

Data Quality & Management (HubSpot)

Resource Source Type Cost Prerequisites
CRM Data Migration Certification HubSpot Academy Certification Free None
Create and Edit Properties in HubSpot HubSpot Knowledge Base Reference Free None
HubSpot Lead Management HubSpot Academy Course Free None

Data Enrichment & Tools

Resource Source Type Cost Prerequisites
Clay University Clay Course Free None
ZoomInfo University ZoomInfo Resources Free None

Advanced / Architecture

Resource Source Type Cost Prerequisites
Data Integration Specialist Superbadge Prep Salesforce Trailhead Trailmix Free Intermediate Salesforce
Integration Patterns and Practices Salesforce Trailhead Trail Free Data Modeling
DAMA Data Management Body of Knowledge (DAMA-DMBOK) DAMA International Framework $79 None

Videos & Talks

Resource Source Length Cost
The Data Warehouse Toolkit Explained Kimball Group 45 min Free
Book Author Why It’s Here Time
Lean Analytics Alistair Croll & Benjamin Yoskovitz Chapter on “What Data to Track” is foundational for understanding metrics hierarchy. Teaches you to think about data strategically. 12-15 hrs
Data Science for Business Foster Provost & Tom Fawcett Builds the mental model for how data becomes insight. Not hands-on technical, but conceptually essential. 15-20 hrs
The Data Warehouse Toolkit Ralph Kimball If you need to understand how data infrastructure actually works (dimensional modeling, ETL concepts). More technical — skip if you’re not building systems. 20+ hrs (reference)

For practical CRM data work, Trailhead modules are more directly applicable than books. The books above are for building deeper conceptual understanding.

Checklist

You’ve mastered this pillar when you can confidently do the following:

Data Health Fundamentals

  • Audit a CRM for data quality issues (duplicates, missing fields, inconsistent values)
  • Calculate a data completeness score for key objects (Accounts, Contacts, Opportunities)
  • Identify the top 5 data quality issues impacting reporting accuracy
  • Build a duplicate detection and merge process
  • (HubSpot) Audit property hygiene — identify unused, duplicate, and inconsistently-used properties
  • (HubSpot) Set up data quality automation using Operations Hub (formatting, deduplication)

Data Infrastructure

  • Map how data flows between your core systems (CRM, MAP, CS platform, etc.)
  • Identify where data gets created, modified, and potentially corrupted
  • Explain the difference between a system of record vs. system of engagement
  • Understand basic ETL concepts (Extract, Transform, Load)

Data Administration & Governance

  • Define field-level ownership (who is responsible for keeping each field accurate?)
  • Create validation rules that prevent bad data entry at the source
  • Build a data dictionary documenting what each field means and how it should be used
  • Establish a regular data hygiene cadence (weekly/monthly cleanup routines)

Data Enrichment

  • Evaluate data enrichment vendors (ZoomInfo, Clearbit, Clay, Apollo, etc.)
  • Define what firmographic/technographic data would improve segmentation
  • Build an enrichment workflow that fills gaps without overwriting good data

Advanced

  • Design a lead scoring model based on enriched data attributes
  • Build account segmentation using firmographic data (industry, size, tech stack)
  • Create data quality dashboards that surface issues before they impact reports

Real-World Application

  • Reduce duplicate account rate below 5%
  • Achieve 90%+ completeness on critical fields (industry, employee count, etc.)
  • Build one segmentation or targeting analysis that drove a business decision

Practical Application

If you have CRM access at work

Export your account data and analyze it in Excel: How many duplicates? What percentage have industry filled in? Employee count? Pick one critical field that’s less than 80% complete. Build a project to fill it (enrichment tool, rep data entry campaign, or manual research).

Create a “Data Quality” dashboard that leadership can see. Nothing drives cleanup like visibility.

If you don’t have CRM access

Use Salesforce Developer Edition (free) and import sample data to practice. Download a messy dataset from Kaggle and clean it — document your process. Sign up for Clay’s free tier and experiment with enrichment workflows.

Portfolio Pieces to Build

  • A data quality audit with findings and recommendations
  • A before/after analysis showing impact of data cleanup
  • A data dictionary for a CRM instance
  • A segmentation analysis using enriched firmographic data