Designate metadata as read-only in the catalog.

We now support the ability to designate certain metadata fields and/or entire resource types as read-only for our catalog customers. This will prevent edits/suggestions in the data.world interface and keep the values synced up with their original source of record.

To get started with this feature, please reach out to customer success!

 


The New Organization Profile Page

The new and improved Organization Profile Page is the default landing page for all organizations across data.world.

Learn about the Core Navigation changes that redirect all organization-specific links to this page, or watch the walk-through videos to learn about the updated functionality you'll find here.


enterprise Enterprise Organizations

Create and manage organization-level resources and connections in a consolidated experience—tailored to your level of access. Discover data faster with custom filters and advanced search syntax for all catalog resources and glossary terms.


community Community Organizations

Share information about your organization with the data.world community, curate datasets and projects, and manage memberships—all from the Organization Profile Page.


improvement Organization-level Connections

Create and manage database connections for your organization as an admin—whether syncing data to your Community datasets or collecting catalogs as an Enterprise organization.


Metrics update: December 17, 2021

Updated metrics tables/reports have arrived on December 17, 2021! Some reports may take 24-48 hours to reflect the new data after deploy due to sync timing.

Data dictionary has been updated to reflect the latest updates as well.

Updated Tables - For both multi-tenant and single-tenant

  1. Events - Metadata Assets Activity - By Day: Column name changed from “resourceid” to “resource” - this change was applied in order to bring this table into conformity with the dimension naming convention used elsewhere in the metrics dataset.
  2. Membership - All Time List: Added “current_member” column (boolean; TRUE: account is currently provisioned; FALSE: account is currently de-provisioned). Added “last_date_active” column (the date of the user’s most recent activity in data.world).
  3. Tops - Requests: Name of table/report changed to “Tops - Most Requested Resources.” Added “resourcetype” column (dimension; indicates whether the requested resource was a dataset, group, etc.).

Collections tab on the Organization Profile Page

Enterprise organizations can now search, sort, and filter collections with the dedicated Collections tab on their Organization Profile Pages.

Collection tab on an organization profile page in data.world, with options for filtering, sorting, and searching.

Members of an organization can look up collections based on title, description, creation date, and more. Admins can also create new collections directly from the Collections tab.

Reset lineage viewport

Data artifact lineage improvements continue as we introduce the ability to reset the lineage component viewport. For customers with large, complex data artifact lineage, we've heard that it can be difficult to reorient once you start exploring the visualization. With this new "Reset view" button, the view and zoom are immediately reset to the starting orientation.


Interactive Lineage hover elements

Another neat feature for our customers leveraging data artifact lineage! In addition to being interactive, the resource items now display a summary card when hovering. You can preview metadata at a glance and click through to the collection, individual tags, or to the resource itself.


Metrics update: October 18, 2021

Updated metrics tables/reports have arrived on October 18, 2021! Some reports may take 24-48 hours to reflect the new data after deploy due to sync timing.

Data dictionary has been updated to reflect the latest updates as well.

Updated Tables - For multi-tenant

  1. Events - Dataset or Project Views By Org - Name changed (from “Events - Views by Org”) and column name “dataset_views” changed to “views”
  2. Events - Searches - Last 90 Days - Fixed a bug that sometimes caused duplicate rows
  3. Membership - Daily Counts - By Org - Name changed (from “Membership - Daily - By Org")
  4. Resources - Org Owned Database connections - Name changed (from “Resources - Database connections”) and added column “owner”
  5. Tops - Bookmarks - Extended range to all users (it previously was limited to the top 10 users) and added column “displayname”
  6. Tops - Dataset Creation - Extended range to all users (it previously was limited to the top 10 users) and added column “displayname”
  7. Tops - Most Bookmarked Resources - Extended date range to all resources (it previously was limited to the top 10 resources)
  8. Tops - Most Comments - All Time - Extended date range to all resources (it previously was limited to the top 10 resources)
  9. Tops - Most Searched Terms - Fixed a bug that sometimes caused duplicate rows
  10. Tops - Most Viewed Resources - Added “catalog” type category to the resource_type variable
  11. Tops - Pageviews By Resource and Agentid - Added “catalog” type category to the resource_type variable

Updated Tables - For single-tenant

  1. Events - Dataset or Project Views By Org - Name changed (from “Events - Views by Org”) and column name “dataset_views” changed to “views”
  2. Resources - Org Owned Database connections - Added column “owner”
  3. Tops - Bookmarks - Extended range to all users (it previously was limited to the top 10 users) and added column “displayname”
  4. Tops - Dataset Creation - Extended range to all users (it previously was limited to the top 10 users) and added column “displayname”
  5. Tops - Most Viewed Resources - Added “catalog” type category to the resource_type variable
  6. Tops - Pageviews By Resource and Agentid - Added “catalog” type category to the resource_type variable



Gra.fo Feature Round Up: October 2021

Watch this month's Gra.fo Round Up to learn more about our recently released features!

1. Drag and drop relationships

Relationship arrows can now be repositioned with a drag and drop action.

2. Export concept as PNG

Download a snapshot image of a concept and its immediate context.

3. Embedded image previews for link to concept

The link to concept feature now includes rich image previews.

4. Composite graphs

Link multiple Gra.fo documents together into one workspace. Separate complex models into subgraphs or extend reference documents that are used in multiple projects.

Round Up


Composite Graph Demo


Beta: Sensitive Data Discovery

Business context

A key aspect of data compliance is knowing where sensitive data lives and applying classifications that relate to policies that inform business processes for proper tracking and management. Identifying sensitive data, applying these policies, and reporting on this information can be an extremely time consuming and error-prone task if attempted manually.

data.world’s Sensitive Data Discovery automates discovery and classification, making it easier for enterprise customers to identify sensitive data and take action on it within the catalog.

Capabilities

Scan – Use advanced machine learning to identify sensitive data types like email addresses, names, ID numbers, locations, protected health information, and 40+ additional data types identifiable out of the box.

Classify – Apply policy classifications, tags, and statuses such as Restricted, Personal Information, US Only, etc. These classifications help maintain the integrity and confidentiality of your data. They are driven by your scan results and other metadata, as dictated by your unique business logic and terminology.

Take Action – Report and audit sensitive data types and policy classifications across your data landscape, understand how it changes over time, and drive better compliance and governance in your organization.

Integrate – Leverage Sensitive Data Discovery metadata as part of your broader metadata orchestration strategy with APIs and bulk export. Our open and extensible platform makes it easy to plug in your broader ecosystem of additional Sensitive Data Discovery tools and platforms for even greater governance capabilities.

Screenshots

Resource page example

Search results example

If you are an existing data.world customer and would like to be included in the private beta, reach out to your Client Success Director for more information.

SQL and SPARQL Time Travel

Business context

Querying data in its current state is the most common data catalog use case, but there are times when it is necessary to compare previous versions of datasets, metadata, and lineage. data.world SQL and SPARQL Time Travel allows customers to view changes across metadata and data and even query historical data sources. 

Capabilities

The new feature provides granular insight into audit trails and analysis of data that is snapshotted across time. You can search both ingested data sources and Snowflake virtual tables for previous states of data. Being able to analyze previous versions of a dataset, even simultaneously with the current version of a dataset, enables flexible analysis across various time scales – review data month-over-month, year-over-year, etc.

In data.world, your metadata is also data and therefore fully queryable and reportable. You can compare previous versions of your metadata with current versions in order to understand how your systems and schemas are changing. See new columns, new column names, sensitive data that recently appeared in a field that wasn't there previously, and much more.

Supported operations include previous version, number of versions back (tip-N), specific timestamp, and offset.

Example: SQL Time Travel Query

Example: SPARQL Time Travel Query


Show Previous EntriesShow Previous Entries