data.world Usage and Audit Events now available as a Snowflake Marketplace Private Listing

As a Snowflake Powered By partner, data.world is proud to announce that usage and audit event data, previously only available in a data.world dataset, is now available as a Snowflake Marketplace Private Listing.  This allows data.world customers access to their full history of data.world events data via the Snowflake Data Cloud enabling high performance and advanced analytic functions on this data.  It also makes data.world events and logging data available via Snowflake with no ETL required for integration in a wide variety of use cases.  To read more about this capability and how to request access to a Private Listing, please see our documentation here.

Resource page navigation just got easier!

Our resource pages just got easier to quickly scan and navigate. With the latest update now in production, the changes mentioned in our earlier post, provide users with the ability to scan the available metadata and get the insights needed to make decisions efficiently and effectively.

The quick section navigation links on the overview tab of your collection, resource, and glossary pages will make navigating the areas of the resource page faster. 


We've also separated the overview and the details of the different types of related resources by giving the related resources their own tabs. The related resources are now highly scannable, sortable, expandable, and searchable. These views offer an organized and condensed presentation of the metadata, making it easier to quickly access and understand the information.


As always, you can read more about these changes in our documentation portal. If you have feedback, please leave a suggestion in our support portal. For those interested in learning more about our data discovery solutions, please visit our website and book a demo or reach out to your customer service representative today. We look forward to helping you and your teams discover your data!

Advanced Search now in the global search bar

We are happy to announce the release of our latest feature: Advanced Search within the global search bar. 


In addition to adding Advanced Search to our global search bar, we've improved it by including the ability to pre-scope your search to a certain Organization. If you are an Enterprise customer with multiple organizations, you can now pre-select the organization to which you want to limit your search.

But wait, there's more!

For customers who have hierarchical collections, you'll want to check out our beta release of the collection picker in the Advanced Search modal. This gives users the ability to quickly scope their search to a branch of collections in the domain hierarchy. Be sure to turn on the beta feature flag in the advanced settings to see this feature.
 


The new advanced search features in the global search bar are designed to provide a more intuitive and powerful data discovery experience from login. With these new capabilities, you'll be able to find the data and insights you need, faster than ever before.

Refer to the advanced search documentation to learn more.

We're committed to continually improving our platform and are eager to hear your feedback. Please feel free to reach out via our support portal to share your thoughts, experiences, and suggestions regarding our new Advanced Search features. Visit our website to learn more about Data Discovery or book a demo. Together, let's unlock the true potential of data-driven decision-making.

Bulk upload and edit all metadata resources

We're thrilled to bring a much-anticipated feature to the platform: the ability to bulk edit all metadata resources by Collection!

Users could previously bulk load and edit business glossary terms, but now this is possible for any type of resource. Simply download a spreadsheet of all the resources in a Collection, enrich them by editing fields or add new resources, and upload the changes to the platform.

On the Settings page of a Collection, users with edit access to the Collection will see the following modal:

Clicking "Download resources" will generate a spreadsheet that contains all resources, sorted by resource type, in that collection. This spreadsheet contains helpful instructions on how to use the spreadsheet on the "Overview" sheet.


After making changes, users can upload the spreadsheet and see a summary of the changes made before confirming the upload:


For more information, refer to the documentation.

Groups access management just got easier!

Organization admins will love our new Groups access summary feature that gives them a full view of the various access a Group might have - organization-wide or direct access to collections and data workspaces (datasets and projects).

To find this new view, go to your Organization and click on Members > Groups > Access summary


You will see two sections: 1) organization-wide group access provides the group access to the entire catalog or all of the workspaces beyond the member default and 2) the direct access control section provides a view of the access the Group has to individual collections or workspaces along with the level of access to each. Users can manage direct access from this view without having to visit each collection or workspace.


This new view makes Group access management much easier by providing a one-stop summary of Group access. You can read more about managing Groups in our documentation portal.

At data.world, our goal is to help organizations unlock the full potential of their data. If you're interested in learning more about agile data governance, please visit our website and book a demo or reach out to your customer service representative.  We look forward to helping you manage your data and transform the way your users discover it!

Introducing the Metadata Collectors Page and CLI Command Builder

Introducing the new Metadata Collectors page and CLI Command Builder tool!

With these features, catalog administrators can more easily set up on-premises collectors and see all the collectors (on-premises or cloud) that are bringing metadata into their catalogs.

Currently in BETA, this feature has two components:

  • CLI Command Builder: allows users to step through a wizard to set up 16 on-premise collectors: Athena, AWS Glue, Databricks, Db2, dbt, MySQL, Oracle, PostgreSQL, PowerBI, PowerBI Gov, Presto, RedShift, Snowflake, SqlAnywhere, SQLServer, Vertica. The wizard generates either a CLI command or a YAML file, so users can more quickly set up collectors during implementation. For more information, refer to the documentation (available sources are denoted as "collector wizard available"). In an upcoming release, all on-premise collectors will be available.

  • Metadata Collectors Page: found in the Settings tab of an Organization, this page shows all of the collectors that are currently appearing in your catalog and other important information, such as the last time the collector ran. This page also includes cloud collectors set up via Connection Manager. For more information, refer to the documentation.

Preview our latest navigation changes

The data.world enterprise catalog provides a 360-degree view of your data resources and semantics. Available in preview today, we've added quick section navigation links on the overview tab of your collection, resource, and glossary pages and separated related resources into their own, sortable, searchable views.

With this upcoming change, users will be able to scan the available metadata and get the insights needed to make decisions efficiently and effectively.


These views offer a highly organized and condensed presentation of the metadata, making it easier to quickly access and understand the information.



We'd love your feedback and thoughts before we roll them into the main UI. To see the views, please click "Turn on preview" in the banner at the top of any collection, resource, or glossary term page. To leave your feedback, please visit the help section (question mark in the lower left of the global navigation) and leave a suggestion via the support link.


You can read more about these changes in our documentation portal. If you're interested in learning more about our data discovery solutions, please visit our website and book a demo or reach out to your customer service representative today. We look forward to helping you and your teams discover your data!

🚀 Take your catalog to the next level with Collection hierarchy

We're happy to announce a NEW FEATURE called Collection hierarchy. This feature will help our enterprise customers organize their domain-driven data catalogs to simplify data management and improve data discovery. 

Collection hierarchy is a tree-like view of your hierarchical collection relationships viewable on the collection overview tab. It organizes data resources and semantic concepts into increasingly more granular or specific groups based on their common characteristics - like domains, categories, markets, etc. It allows you to express your data taxonomy in a way that makes sense to your users:

  • Easily find relevant data: With resources and terms grouped into collections, users can quickly navigate relevant collections.
  • Simplify data management: By grouping data assets into hierarchical collections, you can simplify the process of assigning manage and edit levels of the catalog metadata.

For example, a sales steward might organize resources and terms into different Sales subcollections, such as Sales by Region, Sales by Product, and Sales by Customer. With this structure, your sales analysts can easily find the data they need to make more informed decisions.


Collection hierarchy, combined with other recently released features like Groups, Collection Access Control, and the ability to create new Collection types, provides organizations with the building blocks to build powerful data products and organize their domain-driven data catalog, simplifying data management, and improving decision-making. We look forward to seeing the positive impact it will have on our customers.


At data.world, our goal is to help organizations unlock the full potential of their data. If you're interested in learning more about data mesh and our solutions, please visit our website and book a demo. You can also read more about our collection features on the documentation portal. We look forward to helping you manage your data and transform the way your users discover it!


Harvest your Amazon S3 bucket and object metadata with our newest Collector

Introducing our newest metadata collector: Amazon S3 🎉 

The Amazon S3 Collector catalogs buckets and objects, allowing you to quickly search and discover your data. This new collector harvests metadata about buckets and objects, including the Region, Version State, Size, Last Modified Data, ACL Owner, Grantee and Grant Permission, amongst others (see the full list in the documentation). 

Inside the data.world platform, users will be able to view the relationships between S3 buckets and Objects, enhancing data discoverability. Using our configurable UI, you can display which pieces of metadata are most important to you, such as ACL Permission or S3 Metadata Keys and Values.



Learn how to use the new Amazon S3 Collector in our documentation, or please reach out if you have questions!

Profiling: a new kind of metadata

With the new year comes new features! We are pleased to launch our newest metadata capability: data profiling. This new feature creates metadata describing summary statistics for columns when a collector is run.

These summary statistics will help you understand and trust your data by providing a quick look at the data. For instance, viewing stats like the minimum and maximum values shows the shape of the data, allowing you to know quickly if your data is as expected. 

How can you create profiling metadata? This feature is currently available via the Snowflake, SQL Server, PostgreSQL, and Redshift collectors with more collectors on the near horizon. There are three optional commands that can be used during the collector run to generate the profiling metadata: 

--enable-column-statistics  description: enables harvesting of column statistics

--sample-string-values  description: enables harvesting of histograms for columns containing string data

 --target-sample-size  description: controls the number of rows sampled for computation of column statistics and string-value histograms

You can read more about these commands on the following collector documentation pages:

Show Previous EntriesShow Previous Entries