Take A Tour: Treasure Data CDP Technology
Explore the proprietary big data platform that forms the foundation of our customer data platform (CDP), along with our integrated personalization and decisioning capabilities.
The enterprise CDP built for speed and scale
259B
profiles activated per month
48B
stored customer profiles
<100ms
processing for real-time activations
~2M
records ingested per second
~2.2M
queries per day
21PB
stored in our CDP
Treasure Data CDP at a glance
Treasure Data CDP brings data and intelligence capabilities into a single managed, cloud-based SaaS solution.
Our enterprise customer data platform is built for the complex requirements of global organizations.
Key capabilities include:
- Handle structured and unstructured data at petabyte scale
- Integrate with virtually every major IT, adtech, and martech application
- Streaming, real-time, and batch data capture from mobile, web and database sources
- Zero copy to simplify data movement between your data warehouse and Treasure Data CDP
- Manage complex and customizable customer data sets and data models
- Identity resolution, unification and cleansing for customer profiles and activity data
- Audience building, segmentation, and activation
- Analytics, reporting and predictive modeling
- Privacy and consent, security, and data governance features
- Real-time personalization
- Customer journey orchestration
- Generative and predictive AI

Built on a foundation found nowhere else
Underlying our CDP is a proprietary big data platform that can process all of your customer data, at any volume, no matter the size or complexity of your organization. This foundational excellence has been part of our DNA since the very beginning of our company.
Our origin as a big data platform company provided the foundation for our CDP to batch process trillions of rows of data, deploy globally, and ensure enterprise security and compliance.
Treasure Data’s data storage platform is built on our proprietary distributed, schema-on-read, columnar storage system that is built on top of object-based storage in S3, allowing massive scale while maintaining data security.
All customer data is made available for transformation, cleaning, querying, enrichment, and insights through our Presto and Hive Query engines, which are part of our open-source contributions.
This is a key differentiator because it allows Treasure Data to separate storage and compute, enabling high performance at lower cost to our customers.
Tools to power better personalization and decisioning
Explore the technologies powering our enterprise CDP
Integrations
Data storage
Data collection
Data management
Trust solutions
Profile Unification
Integrations for data ingestion, insights, and activation
Get access to more than 400 out-of-the-box integrations to work with your existing tech stack. Our CDP integrates with virtually any system using API, JDBC, ODBC, batch, and more.
Integrations span stream and batch data collection, native integration with external BI, analytics, modeling, measurement and reporting tools, and external marketing execution and customer experience interaction tools.
You can also transfer your data through SFTP, S3, or a raw data upload.
Want to build your own integration? We provide a custom scripting environment where you can script your own custom integration.
Even more technologies powering our enterprise CDP
Data enrichment
Segmentation and activation
Account-level aggregation
Data science and modeling
Customer analytics
Measurement and reporting
Data enrichment for customer profile unification and identity resolution
Leverage second and third-party data to enrich your existing customer profiles. This data, including demographic, social, purchase, psychographic, intent, behavioral, and location data, comes via a vast ecosystem of partners and data providers or custom datasets.
As part of our identity resolution process, we provide proprietary data hygiene. Or get data hygiene and normalizing services through pre-built integrations with best-of-breed third-party data providers, such as Acxiom, LiveRamp, Allant, and Mapbox.
We also support compliant data sharing for those who want to create partnerships to share data, find ways to build scale and enable activation.