Adding Druid to a cluster
Also available as:
PDF

Druid introduction

Druid is an open-source column-oriented data store designed for online analytical processing (OLAP) queries on event data. Druid is optimized for time-series data analysis and supports the following data analytics features:
  • Real-time streaming data ingestion
  • Automatic data summarization
  • Scalability to trillions of events and petabytes of data
  • Sub-second query latency
  • Approximate algorithms, such as hyperLogLog and theta

These traits make this data store particularly suitable for enterprise-scale business intelligence (BI) applications in environments that require minimal latency and high availability. With Druid you can have applications running interactive queries that "slice and dice" data in motion.

A common use case for Druid is to provide a data store that can return BI about streaming data that comes from user activity on a website or multidevice entertainment platform, from consumer events sent over by a data aggregator, or from any other large-scale set of relevant transactions or events from Internet-connected sources.

Druid is licensed under the Apache License, version 2.0.