Introducing Three New Collectors: Azure Data Factory, DynamoDB, and Teradata

We’re excited to announce three new collectors: Azure Data Factory (ADF), DynamoDB, and Teradata. These collectors gather metadata from these systems and seamlessly bring it into our data.world platform. This metadata helps both technical and non-technical users with discovering and understanding their data quickly, governing their data with greater context, and increasing trust in data by providing information about data health and transformations. 

Azure Data Factory Collector: Detailed Data Tracking

The ADF Collector allows users to understand how your data was moved or transformed, the format changes it underwent, and its migration journey to build a foundation of trust. This collector fetches metadata for Factory, Pipeline, Activity, Linked Service, Dataset, Dataflow, Trigger, Integration Runtime, and Global Parameter within Azure Data Factory. It also provides column-level Lineage, showing how data moves between ADF Datasets and connected sources like Snowflake, Databricks, S3, and ADLS. This helps users understand data movements and transformations, increasing trust. It also allows monitoring of pipelines for health checks, boosting confidence in data integrity and reliability.

DynamoDB Collector: Simplified Discovery

Our DynamoDB Collector helps users discover and understand DynamoDB resources. This collector captures deep metadata for Tables and Streams. It’s useful for both technical users managing DynamoDB resources and non-technical users exploring metadata and understanding how they can use DynamoDB resources through an intuitive interface. Technical users will appreciate getting insight into DynamoDB resources, including Tables, Keys, Indexes, and more. 

Teradata Collector: Comprehensive Data Insight

The Teradata Collector allows users to see a holistic view of all their Teradata assets to help them manage and discover their data. This collector covers metadata for Database, Table, SQL Procedures, User Defined Functions, View, External Procedures, Triggers, User Defined Methods, and User Defined Types. It also offers Profiling and Lineage, showcasing column-level lineage between views and sourced columns, plus lineage for stored procedures. Users can track ownership and freshness of Databases and Tables, which helps understand data quality. Users can also see metadata about how the data was queried via SQL procedures, user defined functions and methods, and triggers, boosting trust in data products.

Start Exploring Today

These collectors enhance how data users explore, discover, understand, and trust their data. Whether you're a tech pro or not, these tools make navigating metadata easier and help teams become more data-driven. Dive into your data world with these new collectors, and embark on a journey of empowered decision-making.

Happy exploring, The data.world Team


SQL Server Reporting Services (SSRS) support for metadata collection is now Live!

Announcing our newest metadata collector - SQL Server Reporting Services (SSRS)! This collector is designed to provide you with an effective solution for extracting metadata from your SSRS environment into your data.world catalog. Our integration facilitates the automated extraction, organization, and presentation of specific metadata elements from your SSRS system. You'll gain valuable insights into your datasets, data sources, folders, KPIs, reports, and linked reports – all within your easily navigable catalog. 

With the SSRS collector, you can:

  • Learn more about your reports and data, including who created a report or dataset and when they were last updated, helping you understand and trust your data
  • See the lineage of which datasets were used in a report, allowing you a comprehensive view of the data flowing into a report
  • Keep track of KPIs from SSRS and integrate them with business metrics from other source systems, all within one easy-to-use catalog, leading to better data-informed decisions

Are you ready to unlock the potential of your SQL Server Reporting Services? You can read more about how this collector works and all it harvests in the documentationThis collector is Tier 2 for Enterprise customers, and is available in dwcc version 2.151 and later.

An example of metadata from an SSRS Report, including Lineage:


Announcing Enhanced Email Notification Options

Visit your notifications settings page to customize the transactional emails you receive from data.world.

You can choose to:

  • Turn off all non-essential email communications
  • Unsubscribe from a category of email notifications
  • Customize which digests you receive
  • Customize dataset and project activity notifications

Learn more

Announcing support for Confluent Kafka metadata

Announcing our newest metadata collector - Confluent Kafka! We know how important it is to have the most up-to-date streaming data, so we’ve created this collector to allow you to easily monitor and collect Kafka metadata from your Confluent streaming platform. 

With Kafka metadata in data.world, you and your teams can: 

  • Easily discover and monitor streaming metadata for real-time applications
  • Understand what is being streamed from on-prem and cloud Confluent
  • Have a single source of truth for your Confluent schemas for better discovery and governance

The data.world Confluent Collector is actually two collectors, one for Confluent Platform (on-prem) and one for Confluent Cloud. With these collectors, you can capture, store, and analyze metadata including Cluster, Consumer, Producer, Broker, Partition, Schema, Consumer Group, Topic, and Environment (for Cloud). The collectors can optionally harvest metadata from Avro, JSON-schema, and Protobuf schemas stored in Confluent Schema Registry.

These Collectors are Tier 2 for Enterprise Customers. You can read the full documentation for Confluent Platform here and for Confluent Cloud here.

Avro Schema example metadata

An example of metadata for an Avro Schema in the data.world platform

Announcing Azure Data Lake Storage Gen 2 Collector and Databricks Collector Lineage and Jobs

We’re excited to announce new enhancements to data.world’s Databricks Collector and a brand new Collector for Azure Data Lake Storage Gen 2! With the help of these additional metadata harvesting and lineage capabilities, you can now get more detailed insights into your data than ever before.

Our Databricks Collector allows you to quickly and easily collect metadata from your Databricks environment into data.world. Now, with the addition of Jobs harvesting and lineage capabilities, you can get a deeper understanding of where your data is coming from, how it’s being used, and what insights you can discover.

Our new Jobs harvesting feature allows you to collect additional information about your workflows, such as creator, description, success, schedule, and more. This lets you better understand how and why your data was transformed.

The new lineage capabilities let you track your data’s journey, from its source all the way through its transformations. This means you can easily trace your data’s history, identify potential bottlenecks or sources of errors, and quickly gain an understanding of how your data has changed over time.

Our Azure Data Lake Storage Gen 2 Collector allows you to bring insights about your data storage layer into data.world. With this Collector, you can efficiently harvest metadata about Blobs and Containers, including the owner, last modified, path, and more. This information is vital for understanding your underlying data, leading to more trust and confidence in your data-driven decision-making.

You can learn more about these Features in our Databricks documentation and our Azure Data Lake Storage documentation. Both these Collectors are Tier 2 for Enterprise Customers.

An image showing am Blob from ADLS in the data.world platform

An example of ADLS Blob metadata in the data.world platform


New Events Available for Metadata Audit

For customers with the data.world Standard Tier or above, two new tables are available in the standard events and logging package `baseplatformdata` dataset.  data.world has always provided metrics and the raw events that drive them as query dataset within the platform allowing our customers to build their own custom success measures, KPIs and dashboards for catalog adoption and metadata programs.  This capability now extends to metadata governance programs as well with special audit event log tables specifically designed to track changes to metadata.

Audit events allow administrators to monitor the actions performed by all the users in the data.world application through the UI or while using API. The audit log reporting functionality enhances the accountability of actions in the application. Administrators can track the actions taken by users in the application and find the root cause of issues by identifying the resources on which the action was performed and who performed the action.

Please see the Audit Events documentation for more details including full table descriptions and sample queries.

data.world Usage and Audit Events now available as a Snowflake Marketplace Private Listing

As a Snowflake Powered By partner, data.world is proud to announce that usage and audit event data, previously only available in a data.world dataset, is now available as a Snowflake Marketplace Private Listing.  This allows data.world customers access to their full history of data.world events data via the Snowflake Data Cloud enabling high performance and advanced analytic functions on this data.  It also makes data.world events and logging data available via Snowflake with no ETL required for integration in a wide variety of use cases.  To read more about this capability and how to request access to a Private Listing, please see our documentation here.

Resource page navigation just got easier!

Our resource pages just got easier to quickly scan and navigate. With the latest update now in production, the changes mentioned in our earlier post, provide users with the ability to scan the available metadata and get the insights needed to make decisions efficiently and effectively.

The quick section navigation links on the overview tab of your collection, resource, and glossary pages will make navigating the areas of the resource page faster. 


We've also separated the overview and the details of the different types of related resources by giving the related resources their own tabs. The related resources are now highly scannable, sortable, expandable, and searchable. These views offer an organized and condensed presentation of the metadata, making it easier to quickly access and understand the information.


As always, you can read more about these changes in our documentation portal. If you have feedback, please leave a suggestion in our support portal. For those interested in learning more about our data discovery solutions, please visit our website and book a demo or reach out to your customer service representative today. We look forward to helping you and your teams discover your data!

Advanced Search now in the global search bar

We are happy to announce the release of our latest feature: Advanced Search within the global search bar. 


In addition to adding Advanced Search to our global search bar, we've improved it by including the ability to pre-scope your search to a certain Organization. If you are an Enterprise customer with multiple organizations, you can now pre-select the organization to which you want to limit your search.

But wait, there's more!

For customers who have hierarchical collections, you'll want to check out our beta release of the collection picker in the Advanced Search modal. This gives users the ability to quickly scope their search to a branch of collections in the domain hierarchy. Be sure to turn on the beta feature flag in the advanced settings to see this feature.
 


The new advanced search features in the global search bar are designed to provide a more intuitive and powerful data discovery experience from login. With these new capabilities, you'll be able to find the data and insights you need, faster than ever before.

Refer to the advanced search documentation to learn more.

We're committed to continually improving our platform and are eager to hear your feedback. Please feel free to reach out via our support portal to share your thoughts, experiences, and suggestions regarding our new Advanced Search features. Visit our website to learn more about Data Discovery or book a demo. Together, let's unlock the true potential of data-driven decision-making.

Bulk upload and edit all metadata resources

We're thrilled to bring a much-anticipated feature to the platform: the ability to bulk edit all metadata resources by Collection!

Users could previously bulk load and edit business glossary terms, but now this is possible for any type of resource. Simply download a spreadsheet of all the resources in a Collection, enrich them by editing fields or add new resources, and upload the changes to the platform.

On the Settings page of a Collection, users with edit access to the Collection will see the following modal:

Clicking "Download resources" will generate a spreadsheet that contains all resources, sorted by resource type, in that collection. This spreadsheet contains helpful instructions on how to use the spreadsheet on the "Overview" sheet.


After making changes, users can upload the spreadsheet and see a summary of the changes made before confirming the upload:


For more information, refer to the documentation.

Show Previous EntriesShow Previous Entries