Skip to Content Treasure Data Logo Treasure Data Logo
  • Platform
    • Overview
      • AI Marketing Cloud
      • Pricing
    • Featured
      • Marketing Super Agent
      • Treasure Data AI
      • Intelligent CDP
      • Modular CDP
      • Integrations
    • AI-Powered Solutions
      • Omnichannel Engagement
      • Real-time Personalization
      • Paid Media Targeting & Optimization
      • Creative Automation for Marketing
      • Support, Clienteling & B2B Interactions
  • Solutions
    • Industries
      • Automotive
      • CPG
      • Entertainment & Media
      • Financial Services
      • Healthcare
      • Retail
      • Technology
      • Travel & Hospitality
  • Customers
  • Resources
    • Explore
      • Resource Library
      • Case Studies
      • Blog
      • Documentation
      • Training
      • Events
      • Webinars
    • Get Started
      • Demo
      • AI Workshop
      • Fast Proof of Concept
      • RFP Template
      • Trade-Up Program
      • Value Calculator
  • Company
    • Company
      • About Us
      • Careers
      • Partners
      • News
      • Trust & Security
      • Contact Us
Login
Get a demo
  • Menu Item 1
    • Sub-menu Item 1
      • Another Item
    • Sub-menu Item 2
  • Menu Item 2
    • Yet Another Item
  • Menu Item 3
  • Menu Item 4
Blog
    • Customer Data Strategy
    • CDP
    • Partners
    • Treasure Data CDP
    • CDP Use Cases
    • AI & Machine Learning
    • CDP | Customer Data Strategy
    • CDP|Customer Data Strategy
    • Company News
    • Data Privacy & Security
    • AI & Machine Learning | Data Privacy & Security
    • AI & Machine Learning | CDP | Data Strategy
    • AI & Machine Learning | Marketing
    • AI & Machine Learning | Privacy & Security
    • AI and Machine Learning | CDP
    • CDP Use Cases|Marketing
    • CDP | CDP Use Cases
    • CDP | CDP Use Cases | Marketing
    • CDP | Customer Data Strategy | Treasure Data CDP
    • CDP | Marketing
    • CDP | Partners
    • CDP|CDP Use Cases|Treasure Data CDP
    • Customer Data Strategy | Treasure Data CDP
    • Customer Service
    • Marketing
    • Marketing | Treasure Data CDP

Get the latest in your inbox.

June 21, 2013

Treasure Data's Plazma: Columnar Cloud Storage

Ron Zvagelsky Ron Zvagelsky
  • Treasure Data CDP

Treasure Data has been developed by Hadoop experts. We get Hadoop, and, in many ways, it's part of our core. As we have built out the platform, we noticed that the storage layer needs to be multi-tenant, elastic, and easy to manage while keeping the scalability and efficiency. This led us to create Plazma, our own distributed columnar storage system in place of HDFS. We wanted to leverage the "store everything now, analyze later" model of our schema-less architecture and provide better performance in terms of storage and query processing.

By separating the MapReduce processing engine of Hadoop and the storage layer, we would be able to optimize the elasticity, efficiency, and reliability of the system. Making our system more modular also allowed us to use columnar storage for our data and allow queries to only parse through the relevant records instead of reading the whole dataset. Plazma led us to process the queries faster, manage databases more simply, and make better use of our schemaless database architecture.

We achieved our technical goals by architecting Plazma in the following ways:

  • JSON processing: automatically converts row-based JSON objects into a columnar format
  • Columnar storage: uses a columnar file storage format which significantly reduces disk IO for analytical queries
  • IO optimizations: implements various IO optimizations such as parallel pre-fetch and background decompression
  • Scalability and ease management: Plazma is built on top of object-based storage, which is more easier to scale and maintain

These are some of the key innovations we made with Plazma to optimize query processing and storage and provide us with a more efficient distributed storage system solution. Some companies make the argument that leveraging HDFS allows for their business to take advantage of open source innovation, which is preferable to on-premise solutions. However, for our purposes, Plazma is much more efficient in terms of query processing and allows us to separate the processing and storage layers for optimizing query processing and manageability.

While this technology is currently proprietary to Treasure Data, we have discussed open sourcing it to provide developers with the tools they need for efficient distributed storage systems meant for big data analytics processing.

What do you think? Would you find this kind of technology useful and would you be interested in using it? Leave your thoughts in the comments.

Topics Covered

  • Treasure Data CDP

Recent Posts

AI & Machine Learning | Data Privacy & Security 2 min read
Responsible AI Is Not Just for Subject Matter Experts—It’s Everyone’s Job
2 min read
From Question to Hypothesis to Action: Meet The Deep Insights Agent
Treasure Data Logo Symbol

+1 866.899.5386 (US)
+1 650.772.4500 (Non-US)

  • Platform
    • Overview
      • Platform Overview
      • Pricing
    • Featured
      • Marketing Super Agent
      • Treasure Data AI
      • Agent Hub
      • Intelligent CDP
      • Modular CDP
      • Integrations
      • Trust for Data & AI
      • Responsible AI
      • UX Research
    • AI-Powered Solutions
      • Omnichannel Engagement
      • Real-time Personalization
      • Creative Automation for Marketing
      • Paid Media Targeting & Optimization
      • Support, Clienteling & B2B Interactions
  • Solutions
    • Industries
      • Automotive
      • CPG
      • Entertainment & Media
      • Financial Services
      • Healthcare
      • Retail
      • Technology
      • Travel & Hospitality
  • Resources
    • Explore
      • Resource Library
      • Case Studies
      • Documentation
      • Blog
      • Training
      • Events
      • Webinars
    • Get Started
      • Demo
      • AI Workshop
      • Fast Proof of Concept
      • RFP Template
      • Trade-Up Program
      • Value Calculator
  • Company
    • Company
      • About Us
      • Careers
      • News
      • Partners
      • Trust & Security
      • Contact Us
      • Customers
  • Get a demo
  • Privacy Statement
  • Cookie Policy
  • Privacy Hub
  • Trademarks
  • Modern Slavery Statement
  • Your Privacy Choices
©2026 Treasure Data, Inc. (or its affiliates) All rights reserved.