Data Steward Studio Administration
Also available as:
PDF

Data Steward Studio Overview

Data Steward Studio (DSS) is one of several services available for Hortonworks DataPlane Service; it provides a suite of capabilities that allows users to understand and govern data across enterprise data lakes.

The goal of the Data Steward Studio is the help data stewards across the enterprise to:

  • Organize and curate data globally

    • Organize data based on business classifications, purpose, protections needed, etc.

    • Promote responsible collaboration across enterprise data workers

  • Understand where relevant data is located

    • Cataloging and searching to locate relevant data of interest (sensitive data, commonly used, high risk data, etc.)

  • Understand how data is interpreted for use

    • Basic descriptions: Schema, classifications (business cataloging), encodings

    • Statistical models and parameters

    • User annotations, wrangling scripts, view definitions etc.

  • Understand how data is created and modified

    • Visualize upstream lineage and downstream impact

    • Understand how schema or data evolve

    • View and understand data supply chain (pipelines, versioning, evolution)

  • Understand how data access is secured/protected and audit usage

    • Understand who can see which data and metadata (e.g. based on business classifications) under what conditions (security policies, data protection, anonymization)

    • View who has accessed what data from a forensic audit/compliance perspective

    • Visualize access patterns and identify anomalies

For a list of features that are available in this release, see Evaluation Software Features.