Skip to Content Treasure Data Logo Treasure Data Logo
  • Product
    • AI Agents
      • AI Overview Your coordinated intelligence system for marketing.
      • Marketing Super Agent
      • Agent Hub
      • Treasure Code
    • AI Marketing Cloud
      • Overview Data, intelligence, activation — all in one platform.
      • Omnichannel Engagement
      • Real-time Personalization
      • Paid Media Targeting & Optimization
      • Creative Automation for Marketing
      • Support, Clienteling & B2B Interactions
    • Customer Data Platform
      • CDP Overview The system of record and context for first-party customer intelligence.
      • Integrations
      • Hybrid Architecture
      • Trust & Security
  • Solutions
    • Industries
      • Automotive
      • CPG
      • Entertainment & Media
      • Financial Services
      • Healthcare
      • Retail
      • Technology
      • Travel & Hospitality
  • Customers
  • Resources
    • Explore
      • Resource Library
      • Case Studies
      • Blog
      • Documentation
      • Training
      • Events
      • Webinars
    • Get Started
      • Pricing Level up your marketing, lower your costs.
      • Demo Experience Treasure Data with an expert-led walkthrough.
      • Trade-Up Program Replace your CEP, CDP, or ESP. Save big with special incentives.
      • AI Workshop Accelerate agentic marketing with a personalized strategy session.
      • Fast Proof of Concept Test us out for two weeks using your data.
      • RFP Template Get to your shortlist of vendors faster.
      • Value Calculator Get a customized report about our potential annual value.
  • Company
    • Company
      • About Us
      • Careers
      • Partners
      • News
      • Contact Us
      • Terms
Login
Get a demo
  • Menu Item 1
    • Sub-menu Item 1
      • Another Item
    • Sub-menu Item 2
  • Menu Item 2
    • Yet Another Item
  • Menu Item 3
  • Menu Item 4
Blog
    • Customer Data Strategy
    • CDP
    • AI & Machine Learning
    • Partners
    • CDP Use Cases
    • Data Privacy & Security
    • Treasure Data CDP
    • CDP | Customer Data Strategy
    • CDP|Customer Data Strategy
    • Company News
    • Marketing
    • AI & Machine Learning | CDP | Data Strategy
    • AI & Machine Learning | Data Privacy & Security
    • AI & Machine Learning | Marketing
    • AI & Machine Learning | Privacy & Security
    • AI and Machine Learning | CDP
    • CDP Use Cases|Marketing
    • CDP | CDP Use Cases
    • CDP | CDP Use Cases | Marketing
    • CDP | Customer Data Strategy | Treasure Data CDP
    • CDP | Marketing
    • CDP | Partners
    • CDP|CDP Use Cases|Treasure Data CDP
    • Customer Data Strategy | Treasure Data CDP
    • Customer Service
    • Marketing | Treasure Data CDP
June 21, 2013

Treasure Data's Plazma: Columnar Cloud Storage

Ron Zvagelsky Ron Zvagelsky
  • Treasure Data CDP

Treasure Data has been developed by Hadoop experts. We get Hadoop, and, in many ways, it's part of our core. As we have built out the platform, we noticed that the storage layer needs to be multi-tenant, elastic, and easy to manage while keeping the scalability and efficiency. This led us to create Plazma, our own distributed columnar storage system in place of HDFS. We wanted to leverage the "store everything now, analyze later" model of our schema-less architecture and provide better performance in terms of storage and query processing.

By separating the MapReduce processing engine of Hadoop and the storage layer, we would be able to optimize the elasticity, efficiency, and reliability of the system. Making our system more modular also allowed us to use columnar storage for our data and allow queries to only parse through the relevant records instead of reading the whole dataset. Plazma led us to process the queries faster, manage databases more simply, and make better use of our schemaless database architecture.

We achieved our technical goals by architecting Plazma in the following ways:

  • JSON processing: automatically converts row-based JSON objects into a columnar format
  • Columnar storage: uses a columnar file storage format which significantly reduces disk IO for analytical queries
  • IO optimizations: implements various IO optimizations such as parallel pre-fetch and background decompression
  • Scalability and ease management: Plazma is built on top of object-based storage, which is more easier to scale and maintain

These are some of the key innovations we made with Plazma to optimize query processing and storage and provide us with a more efficient distributed storage system solution. Some companies make the argument that leveraging HDFS allows for their business to take advantage of open source innovation, which is preferable to on-premise solutions. However, for our purposes, Plazma is much more efficient in terms of query processing and allows us to separate the processing and storage layers for optimizing query processing and manageability.

While this technology is currently proprietary to Treasure Data, we have discussed open sourcing it to provide developers with the tools they need for efficient distributed storage systems meant for big data analytics processing.

What do you think? Would you find this kind of technology useful and would you be interested in using it? Leave your thoughts in the comments.

Topics Covered

  • Treasure Data CDP

Recent Posts

Data Privacy & Security 2 min read
Innovation Without Compromise: How We Use AI While Keeping Security Non-Negotiable
CDP Use Cases 2 min read
How Michaels Turned Billions of Customer Signals Into Personalized Experiences
Treasure Data Logo Symbol

+1 866.899.5386 (US)
+1 650.772.4500 (Non-US)

  • Product
    • AI Agents
      • AI Overview
      • Marketing Super Agent
      • Agent Hub
      • Treasure Code
      • Responsible AI
      • UX Research
    • AI Marketing Cloud
      • Overview
      • Omnichannel Engagement
      • Real-time Personalization
      • Creative Automation for Marketing
      • Paid Media Targeting & Optimization
      • Support, Clienteling & B2B Interactions
    • Customer Data Platform
      • CDP Overview
      • Integrations
      • Hybrid Architecture
      • Trust & Security
  • Solutions
    • Industries
      • Automotive
      • CPG
      • Entertainment & Media
      • Financial Services
      • Healthcare
      • Retail
      • Technology
      • Travel & Hospitality
  • Resources
    • Explore
      • Resource Library
      • Case Studies
      • Blog
      • Pricing
      • Documentation
      • Training
      • Events
      • Webinars
    • Get Started
      • Demo
      • AI Workshop
      • Fast Proof of Concept
      • RFP Template
      • Trade-Up Program
      • Value Calculator
  • Company
    • Company
      • About Us
      • Customers
      • Partners
      • Careers
      • News
      • Contact Us
      • Terms
  • Get a demo
  • Terms & Conditions
  • Privacy Statement
  • Cookie Policy
  • Privacy Hub
  • Trademarks
  • Modern Slavery Statement
  • Your Privacy Choices
©2026 Treasure Data, Inc. (or its affiliates) All rights reserved.