Skip to content
D-LIT Logo

Data Sources

CRM systems, surveys, transaction records, and engagement logs.

By D-LIT Team

Customer analytics is only as accurate as the data that feeds it. Most organizations have more customer data than they realize, spread across systems that were never designed to talk to each other. CRM records, product event logs, support tickets, survey responses, payment histories, and email engagement data each capture a distinct dimension of the customer relationship. The challenge is not usually data scarcity; it is fragmentation, inconsistent definitions, and the absence of a reliable customer identifier that connects records across systems.

This guide covers eight categories of customer data sources, what each one provides, the specific platforms commonly found in each category, and the integration considerations that determine whether the data can be trusted and used at scale.


CRM Systems

What they provide. CRM platforms are the system of record for commercial relationships. They capture contact and account hierarchies, opportunity and pipeline history, contract and renewal dates, account ownership, sales activity logs, and often product-specific fields that teams add over time. For B2B companies in particular, the CRM is typically the only system that carries the full account structure: parent accounts, subsidiaries, contacts by role, and the history of commercial interactions over the life of the relationship.

Common platforms. Salesforce is the dominant enterprise CRM. HubSpot has become the standard for growth-stage and mid-market companies. Pipedrive, Zoho CRM, and Microsoft Dynamics are also in wide use.

What they do not provide. CRM data captures what happened in sales and customer success interactions, not what happened in the product. It is retrospective and human-entered, which means it is subject to data quality issues: incomplete records, inconsistent naming conventions, outdated contact information, and fields that were never rigorously maintained. CRM data alone cannot tell you whether a customer is engaged with your product.

Integration considerations. The CRM account and contact IDs often do not match IDs in other systems. Establishing a canonical customer or account ID that can be joined across the CRM, product analytics platform, and support system is a prerequisite for any meaningful cross-system analysis. This is typically addressed through a customer data platform or a manually maintained mapping table in the data warehouse.


Product Analytics Platforms

What they provide. Product analytics tools capture behavioral event data generated by users interacting with the product. Every page view, feature click, workflow completion, and session represents an event. These platforms allow teams to construct funnels, analyze feature adoption, build retention curves, and identify the behavioral patterns that distinguish high-value users from low-value ones.

Common platforms. Mixpanel and Amplitude are the two most widely used product analytics platforms. Heap differs from both in that it auto-captures all events by default rather than requiring manual instrumentation. PostHog is an open-source alternative gaining traction with engineering-led organizations. Segment is commonly used as an event routing layer that feeds multiple downstream tools from a single instrumentation effort.

What they do not provide. Product analytics platforms capture what users do, not who they are in a business context. They typically lack account hierarchy, deal history, and contract data. They also tend to be session-scoped rather than account-scoped, which creates challenges for B2B analysis where the unit of analysis is the account, not the individual user.

Integration considerations. The most critical integration is connecting product event data to account-level data from the CRM. This requires a consistent user ID or email that appears in both systems. User-level event data must be rolled up to the account level for B2B analysis. Data volumes from product analytics platforms can be large; storage and query costs in the warehouse should be planned for, and event schemas should be governed to prevent schema drift over time.


Customer Feedback and Survey Platforms

What they provide. Survey and feedback platforms capture explicit customer sentiment through structured questionnaires. The most common applications are NPS surveys (measuring overall loyalty and likelihood to recommend), CSAT surveys (measuring satisfaction with specific interactions), and CES surveys (measuring perceived ease of accomplishing a goal). More sophisticated uses include open-ended verbatim responses that are analyzed for themes and drivers.

Common platforms. Delighted is widely used for simple, high-volume NPS programs. Medallia and Qualtrics are the enterprise leaders for multi-channel feedback management, including relationship surveys, transactional surveys, and employee experience programs. Typeform is common for custom survey workflows. SurveyMonkey is used for ad hoc research.

What they do not provide. Survey data captures what customers say, not what they do. Response bias is a structural limitation: customers who respond to surveys are not a random sample of the customer base. Promoters and Detractors both tend to over-index in response rates relative to passive customers. This means aggregate NPS from surveys may not accurately represent the full customer base.

Integration considerations. Survey response data needs to be connected to the customer record in the CRM and, where possible, to product and revenue data. This connection allows you to test whether NPS scores predict churn, whether CSAT correlates with renewal rates, and which customer segments are underrepresented in survey responses. Many platforms offer native Salesforce integrations or webhook-based exports for data warehouse ingestion.


Customer Support and Service Systems

What they provide. Support ticketing systems capture every customer-reported issue, request, and interaction. This data reveals the specific friction points customers encounter, how quickly those issues are resolved, and which issue types recur. At scale, support data becomes a leading indicator of product quality issues, onboarding failures, and segments that are struggling with the product.

Common platforms. Zendesk is the dominant mid-market and enterprise support platform. Intercom serves companies that combine support with in-app messaging and automated campaigns. Freshdesk is a common alternative. Salesforce Service Cloud is used by companies that want support tightly integrated with CRM.

What they do not provide. Support data is reactive. It captures problems customers report, not problems they experience silently and then churn over. A significant portion of customer dissatisfaction never generates a support ticket. Support data should be used alongside proactive signals such as product engagement and survey data, not as a substitute for them.

Integration considerations. Support data connected to product and CRM data enables powerful analysis: which accounts are generating the most support volume, whether high support volume predicts churn, and whether specific issue types cluster in particular segments or product areas. The customer or account identifier in the support system must be mapped to the canonical identifier used across other systems.


Payment and Subscription Management Systems

What they provide. Payment and subscription platforms are the system of record for actual revenue events: subscription starts, renewals, upgrades, downgrades, payment failures, and cancellations. This data provides the most financially authoritative view of MRR movement, churn events, and expansion revenue. For B2C subscription businesses in particular, where self-serve cancellation is common, this system often captures churn before any other system reflects it.

Common platforms. Stripe is the dominant payment processor and subscription management platform for B2C and developer-led B2B companies. Chargebee and Recurly are purpose-built subscription management platforms that handle more complex billing models including metered usage, multi-currency, and enterprise contracts. Zuora is the enterprise standard for companies with highly complex billing requirements.

What they do not provide. Payment systems capture financial transactions but typically lack the context to explain them. A cancellation event tells you that a customer left; it does not tell you why. Combining payment data with survey, support, and product data is necessary to understand the causal drivers of churn events.

Integration considerations. Payment data should flow into the data warehouse on a near-real-time basis for accurate MRR tracking. Event-based ingestion (rather than batch daily pulls) enables more responsive churn alerting. Payment failure events are worth treating as a distinct category: payment failures that go unresolved are a source of involuntary churn that can often be recovered through dunning campaigns.


Email and Marketing Automation Platforms

What they provide. Email marketing platforms capture engagement with communication sent to customers and prospects: open rates, click-through rates, unsubscribe rates, and conversion events tied to email campaigns. For lifecycle marketing programs, they also contain the segmentation and campaign logic applied to customers at different lifecycle stages.

Common platforms. Klaviyo is the dominant platform for e-commerce and D2C brands. Mailchimp is widely used by smaller businesses. HubSpot Marketing Hub serves companies that want email tightly integrated with CRM. Customer.io and Iterable are purpose-built for product-driven lifecycle marketing and are common in SaaS.

What they do not provide. Email engagement metrics are increasingly unreliable as proxies for genuine customer interest. Apple Mail Privacy Protection and similar features inflate open rates artificially. Click-through rates are a more reliable signal, but overall engagement benchmarks have shifted. Email platform data should be treated as a supplement to product and survey data rather than a primary indicator of customer health.

Integration considerations. Email platform data is most valuable when connected to product event data and CRM records. This allows you to measure whether specific email campaigns drive meaningful product re-engagement, not just clicks. Event-driven email platforms like Customer.io and Iterable use product events as triggers, which means product analytics and email platform data are already logically integrated by design.


Web Analytics Platforms

What they provide. Web analytics platforms capture visitor behavior on public-facing web properties: marketing site traffic, content engagement, conversion funnel performance, and attribution of acquisition traffic to sources and campaigns. For e-commerce businesses, web analytics platforms also capture purchase funnel data and revenue attribution.

Common platforms. Google Analytics 4 (GA4) is the dominant web analytics platform. Adobe Analytics serves large enterprise organizations with complex data governance requirements. Plausible and Fathom are privacy-first alternatives gaining adoption in organizations that need to minimize cookie consent friction.

What they do not provide. Web analytics platforms capture pre-acquisition and anonymous behavior. They typically cannot connect anonymous visitor sessions to known customer records without deliberate instrumentation. This limits their utility for post-acquisition customer analytics. Their primary value in a customer analytics context is in understanding acquisition channels and measuring the effectiveness of content and campaigns aimed at generating customer interest.

Integration considerations. GA4 UTM parameters and campaign data should be passed through to the CRM at the point of conversion so that the acquisition source of each customer is captured and preserved. This enables CAC calculation by channel and LTV by acquisition source, one of the most strategically valuable analyses available to growth-stage companies.


Customer Data Platforms and Data Warehouses

What they provide. Customer Data Platforms (CDPs) and cloud data warehouses serve as the integration layer where data from all of the above source systems is unified around a canonical customer identity. CDPs like Segment, Rudderstack, and mParticle focus on real-time identity resolution and audience activation. Data warehouses like Snowflake, BigQuery, and Redshift provide the analytical layer where historical data from all sources can be joined, transformed, and queried.

Why this layer matters. Without a unification layer, customer analytics is confined to single-system analysis. With it, you can build a complete customer profile that combines CRM account data, product behavioral data, support history, payment events, and survey responses into a single record per customer. Analytics platforms such as Plotono can connect to these warehouse environments and provide the visualization and pipeline layer that turns unified customer data into operational dashboards. This unified record is the prerequisite for meaningful cohort analysis, churn prediction modeling, and LTV modeling.

Integration considerations. Identity resolution, the process of matching a customer record across systems using email, user ID, or other identifiers, is the core challenge in building the unified layer. This process is rarely clean. Customers appear under multiple email addresses, companies merge and have records in multiple accounts, and systems use incompatible ID formats. Investing in a well-governed identity resolution process early pays significant dividends as the analytics program matures.


For how to apply analytical techniques to the data from these sources, see the Techniques and Models guide. For the KPIs these data sources feed, see the Customer KPIs guide. For how to present unified customer data to executive and operational audiences, see the Dashboards guide.

Get More from D-LIT

Ready to transform your analytics capabilities? Talk to our team about how D-LIT can help your organisation make better, data-driven decisions.

Get in Touch