SQL and SPARQL Time Travel

Business context

Querying data in its current state is the most common data catalog use case, but there are times when it is necessary to compare previous versions of datasets, metadata, and lineage. data.world SQL and SPARQL Time Travel allows customers to view changes across metadata and data and even query historical data sources. 

Capabilities

The new feature provides granular insight into audit trails and analysis of data that is snapshotted across time. You can search both ingested data sources and Snowflake virtual tables for previous states of data. Being able to analyze previous versions of a dataset, even simultaneously with the current version of a dataset, enables flexible analysis across various time scales – review data month-over-month, year-over-year, etc.

In data.world, your metadata is also data and therefore fully queryable and reportable. You can compare previous versions of your metadata with current versions in order to understand how your systems and schemas are changing. See new columns, new column names, sensitive data that recently appeared in a field that wasn't there previously, and much more.

Supported operations include previous version, number of versions back (tip-N), specific timestamp, and offset.

Example: SQL Time Travel Query

Example: SPARQL Time Travel Query


Interactive lineage items

For customers leveraging data artifact lineage, the resource items are now interactive. You can click through the icons to view the respective resource pages.

🚨 Default Behavior Change: PATCH API endpoints 🚨

The data.world public API supports several options for programmatically making updates to resources on the platform. PATCH is a method for making partial updates to individual records, such as adding tags, changing a description, or modifying a title.

In the next two weeks, we will be making a change to the way PATCH endpoints modify list values. We outline these changes below.


Existing Merge Behavior

Lists are merged with existing values on PATCH requests

  1. A dataset has tags: [tag A , tag B
  2. A PATCH request is sent to /datasets/democorp/my-example-dataset with body: { "tags": [ "tag C", "tag D" ]  }
  3. The dataset is updated to reflect tags: [ tag A, tag B, tag C ,tag D]
  4. A PATCH request is sent to /datasets/democorp/my-example-dataset with payload: { "tags": []  }
  5. No change is applied and the tags remain: [ tag A, tag B, tag C ,tag D]


New Replace Behavior

Lists replace existing values on PATCH requests

  1. A dataset has tags: [tag A , tag B
  2. A PATCH request is sent to /datasets/democorp/my-example-dataset with body: { "tags": [ "tag C", "tag D" ]  }
  3. The dataset is now updated to have tags: [ tag C ,tag D].  tag A and tag B have been removed.
  4. I send a PATCH request to /datasets/democorp/my-example-dataset with body: { "tags": []  }
  5. The dataset has been updated to remove all tags.


Why we are making this change

Today, PATCH can be used to add, modify, or remove fields for all non-list values. With the current merge logic, items can only be appended to list values using PATCH. As a consequence, if you want to remove or reorder the items in a list, you must use the PUT method, which does not support partial updates and requires a full overwrite of the existing record. The new logic to overwrite list values will allow users to make partial updates to records that remove or modify the order of items in the list without needing to modify the entire record.

This new logic primarily impacts tags, file labels, collections, and multi-select custom metadata fields.

Advanced Search Builder + New All Results Tab

This week, our search page got a face lift and we unveiled new tools for searching on data.world.

Explore data.world's rich advanced search syntax with the Search Builder tool on our main search page. This friendly form helps you construct more complex searches with multiple filters, logical operators, categories, and custom metadata fields. The Search Builder can be accessed by selecting the "Advanced" option above the filters list on the main search page.

This release also includes changes to our main search page. You'll notice a new layout on the All Results tab that shows the top 3 search hits by type for your term. This tab now shows more results per page and gives users a high level overview of the types of resources they can find on the platform. Hover over the circular "i" icon for more details about the result. More targeted results can be viewed on the Resources tab. You'll also notice changes to the category tabs at the top of the page. Resources, Organizations & People, Comments, and Columns each have their own tailored search experience.

The new search experience is available today for select users and will be available for all users early next week.

Beta: Postgres Proxy

We're happy to announce the beta release of our new Postgres Proxy support. Our goal is to allow any BI and data science tools which support PostgreSQL, to connect to directly with data.world. No 3rd party integration support required!

data.world provides a federated engine to query data from multiple data systems simultaneously, at the source. By using Postgres Proxy, it’s now easier than ever to extend these capabilities to your favorite analysis tools for quickly accessing and creating value from data.

To connect to data.world using the proxy, simply create a new PostgreSQL connection, configured as follows:

host: postgres.data.world
port: 5432
user: {your data.world user id}
pass: {read/write token}
db:   agentid/datasetid

You can find your read/write token in the user settings. If you have any issues or questions, don't hesitate to reach out.

Note: for single tenant customers, set host to postgres.{site}.data.world.

Metrics update: July 28, 2021

For enterprise customers, updated metrics have been released today to your Usage and Governance Reporting (ddw-metrics-*) dataset to address some minor bugs and performance improvements.

Potential observable changes:

  • For both single-tenant and multi-tenant customers, some reports were not reflecting de-provisioned user accounts. With this fix, multi-tenant customers may find a slight increase in counts in Visits - Return Visitors - Daily, Visits - Return Visitor Days, Visits - Unique Visitor Days and Visits - Unique Visitors - Monthly now properly reflecting de-provisioned user accounts in addition to active user accounts. Single-tenant customers may find a slight increase in Visits - Unique Visitors Daily and Visits - Unique Visitor Days.
  • For multi-tenant customers, under certain circumstances some reports could duplicate-count users that were members of multiple sub-organizations. With this fix, you may find a slight decrease in counts in Membership - By Date, Visits - Adoption Daily and Visits - Avg Visits Weekly.

If you have any questions or concerns, please let us know at support@data.world or via your customer success representative.

Coming Soon: Search page improvements

In August, we plan to release a series of improvements to our search page including: 

  • Category tabs to replace the result type dropdown
  • Collapsible filter groups for the left sidebar
  • A new discovery-driven All Results Page to highlight a more comprehensive set of result types for broad searches
  • A new Advanced Search Builder utility for more complex searches that take advantage of boolean operators and our advanced search syntax


Coming Soon: Resource page breadcrumb navigation

We're investing in navigation enhancements across the site for community and enterprise users. This week, we plan to roll out a beta breadcrumbs feature to help you get back to your list views from your resource pages. This first beta release will include easy access to your organization page from metadata catalog resources, quick filters to navigate back to similar items in your catalog, and basic hierarchical navigation for things like tables and columns. Additional hierarchical navigation support is planned for the coming months.


Announcing new data.world Documentation Portal

We're thrilled to announce the release of our brand new documentation portal. We've remastered all our platform documentation content to make it easier to read, search, and navigate. You can now get all the answers you need in just one place!

These are some of the improvements and new navigation experiences you'll see  in our new portal:

- Home page menus

Use the top menus to navigate the documentation for specific product versions: Community docs for data.world community members and Enterprise docs, specifically written for our enterprise customers who have needs and use cases outside of the scope of our community users.

- Search improvements 

Search is one of the most important ways to find the information you need. So we’ve introduced a new search experience. When you enter a query, you’ll immediately see some suggestions. 

- Easier access to everything you need

Easy access to integrations gallery, API docs, and Grafo documentation.

Check it out at https://docs.data.world/.

Show Previous EntriesShow Previous Entries