⚡️ data.world March Product Launch: AI, Usability, and Lineage, Supercharged

This month’s release is packed with powerful updates designed to unlock more from your catalog. From the launch of Archie Chat, your AI-powered assistant for instant answers, to a new design and functionality for our Resource pages, we’re bringing faster workflows and deeper insights to every user. We’ve also rolled out new and enhanced collectors—from Amazon DMS to AWS Glue, ADF, and beyond—giving data teams the visibility they need to govern modern, complex data stacks. And coming soon: new Governance Dashboards that bring actionable insights, helping you measure engagement and optimize your data strategy.

🔥 What’s New?

💬 Archie Chat – Your AI-Powered Data Assistant is Live

Say hello to Archie Chat, the intelligent assistant that turns your data catalog into an interactive knowledge hub. Now available in Public Preview, Archie delivers instant, context-aware answers. Archie Chat answers questions and helps you navigate the catalog instantly, using context from your own data resources, glossary terms, and our product documentation to help you get answers faster.

Data consumers can use natural language to ask questions like “Tell me what you know about marketing campaign attribution” or “What does ACV mean?”, and get quick, reliable answers from your catalog. Stewards and admins can ask things like, “What’s the fastest way to add lots of relationships at once?” and get immediate, actionable guidance—no digging through docs required. 

Quick Tip: Archie is now in Public Preview and free to try. Reach out to your Customer Success representative to get started today!


 Resource Page Redesign – Usability Reimagined

The Resource Page redesign and new functionality is here. We've overhauled the resource page experience, creating a more intuitive layout, providing edit history, and other new powerful features that put speed and insight at your fingertips:

  • Streamlined layout and design for faster understanding and navigation
  • Rich activity feed that capture your data's story
  • Inline editing capabilities
  • New preview options
  • Enhanced filtering options

The new Activity feed is now generally available and you can read more about it here. The redesigned Resource Overview page and tabs are in Public Preview and users can read more about that here. You can read more about it here. One click is all it takes to transform your workflow.


🔄 Amazon Data Migration Service Collector – Lineage Through the Cloud Migration Journey

Our new Amazon DMS Collector brings visibility into your cloud migration pipelines by capturing lineage from AWS Data Migration Service (DMS) jobs. Whether you’re migrating on-prem databases to the cloud or using DMS for change data capture (CDC), this collector gives you line-of-sight traceability into how data moves across systems. Now, you can govern these migration flows with confidence, maintain compliance, and ensure your catalog reflects your evolving data architecture.

💫 What’s Improved?

🧬 AWS Glue Collector – Deeper Metadata, Stronger Lineage

We’ve supercharged the AWS Glue Collector to bring in even richer metadata and more complete lineage. The collector now captures detailed metadata from Glue Data Catalog tables—including file types, sizes, serializers, and deserializers—as well as improved job metadata. Even better, it now builds lineage to S3 objects, providing full traceability back to the ultimate source. This enhancement empowers more robust impact analysis and stronger governance across your AWS data landscape.

🔗 Azure Data Factory Collector – Smarter Lineage for Parameterized Pipelines

The ADF Collector just got an upgrade: it now captures lineage from parameterized datasets, surfacing both upstream and downstream references in complex, dynamic pipelines. This gives data teams clearer visibility into data movement across ADF workflows—especially in cases where pipeline logic changes based on parameters—resulting in more accurate lineage, improved trust, and tighter control over your data flows.

🔧 Public API Enhancements – Create & Manage Discussions Programmatically

Now you can create and manage discussions directly through the public API, making it easier to capture critical conversations wherever they happen. Whether integrating with ticketing systems, BI tools, or custom workflows, this enhancement turns your catalog into a true collaboration hub—bridging the gap between data context and team communication. By embedding conversations directly into the metadata layer, you unlock smarter decision-making and greater operational efficiency.


🔮 What’s Next?

📊 Coming Soon - New Governance Dashboards

Later this week, we’re launching a new suite of Governance Dashboards in the Admin Portal—your command center for understanding how your data catalog is being used. These interactive, visual dashboards give instance administrators the tools they need to monitor platform activity, analyze user engagement, and uncover trends across your data.world environment. From search behavior to resource metrics and usage, and daily active users, you now have the clarity to spot what’s working, what’s being overlooked, and where to invest next.

Built with ready-to-use views and flexible filters, these dashboards empower teams to make faster, smarter decisions about content strategy, user enablement, and platform optimization. This feature is exclusively available to users with Instance Administrator permissions in private instances and single-tenant deployments.


🔄 Coming Soon – New Marquez / OpenLineage Collector

Our upcoming Marquez Collector brings support for the OpenLineage standard, enabling teams to capture lineage from custom pipelines built with Python, PySpark, SQL, and more. By pulling lineage directly from the Marquez metadata store, this collector helps you document and govern bespoke or proprietary data pipelines—even when no native connector exists.

🏢 Coming Soon – New SAP HANA Collector

We’re expanding our enterprise coverage with a new SAP HANA Collector, designed to harvest standard database metadata—and even some lineage—from SAP’s powerful data warehousing platform. This collector will help customers bring critical SAP assets into the catalog for better visibility, governance, and reuse.

Ready to unlock more from your data? Contact your Customer Success representative to explore these groundbreaking updates and unlock the full potential of data.world, the simpler and smarter catalog.


🚀 Smarter Data Management: New AI-Powered Tools + UX Upgrades!

This month, we’re rolling out enhancements designed to streamline data management, improve usability, and boost productivity. From AI-powered bulk descriptions to a smoother access request experience—and two exciting launches on the horizon—there’s a lot to explore!

🔥 What’s New?

Archie Bulk Describe – Now in Private Preview (Beta)

Say goodbye to manual descriptions and hello to AI-powered enrichment at scale! Archie Bulk Describe lets you generate high-quality descriptions for multiple resources in just a few clicks, making it an essential tool for curators and data product owners. With this feature, you can quickly generate AI-suggested descriptions, ensuring your catalog remains accurate and insightful—without the tedious manual work.

If you’re not yet part of the Archie Private Preview, reach out to your Customer Success Manager today! Need help getting started? Check out our product documentation for details on how to enable Archie Private Preview features and how to run Archie Bulk Describe in your catalog.


💫 What’s Improved?

A Smoother, Smarter Way to Request Access

We’ve made a series of improvements to access request workflows, making them faster and more intuitive. With auto-filled titles and smart auto-skipping, requesting access to resources—or creating new ones—has never been easier. These updates reduce friction, minimize manual steps, and ensure a seamless, consumer-grade experience for enterprise users.

This is part of our ongoing commitment to refining the user experience to feel simple and intuitive.

🔮 What’s Next?

✨ Resource Page Redesign – A New Era of Usability

We’ve completely reimagined the Resource Details Page to make navigating and understanding data assets easier than ever. With an intuitive new layout, edit history, improved summaries and layout, finding the information you need will feel effortless. We’ve also made everything quicker—quicker access to lineage, quicker metadata edits with inline editing, and quicker ways to explore related data with more configuration, filtering and sorting options. The result? A night-and-day improvement in user experience that will feel simple yet smart.

Very soon, this redesign will be available in Public Preview, giving end-users the opportunity to opt-in and experience the new workflow before full rollout.

💬 Meet Archie Chat – Your AI-Powered Data Assistant

Navigating your data catalog just got smarter. Archie Chat (soon in Public Preview Beta) brings an AI-powered, business-context-aware chat experience to your platform, answering questions instantly, reducing friction, and improving adoption. Instead of digging through documentation, catalog admins can now ask “How do I bulk edit metadata?” and get an immediate, actionable response. Data consumers can type natural language questions like “Is ‘Occupation’ a sensitive data type?” or “What does TMA stand for?” and get more precise answers sourced from the context of your data catalog.

Archie Chat is launching in beta, free for a limited-time, so be among the first of our customers to explore this game-changing feature! If you aren't already an Archie-enabled customer, reach out to your customer service manager to find out how you can be ready to try Archie Chat when it releases.

data.world January Product Launch

January Releases: Expanding Connectivity, Simplifying Setup, and Enhancing Lineage Exploration

This month, we’re delivering powerful updates to enhance metadata collection, streamline setup, and improve lineage exploration. We’re expanding our connectivity with new on-premise collectors for Apache Airflow and Qlik Talend, enabling deeper metadata harvesting and lineage tracking for critical ETL and workflow automation tools. People field configuration is now more intuitive, allowing user accounts to be dynamically selected for ownership and stewardship, reducing manual setup and improving governance. Finally, our new public API endpoint for lineage querying makes it easier for customers to customize lineage exploration with flexible queries and standardized outputs. These updates help teams work smarter, get to insights faster, and build on top of our platform with greater ease. 🚀



Support for Airflow and Talend on-premise Collection

By the end of January, the data.world collector integrations will include new collectors for Apache Airflow and Qlik Talend Data Integration (the on-premise version of Qlik’s Talend product). Airflow is an open source workflow automation tool that many enterprises use to schedule and manage data engineering and analytical tasks. The new collector will harvest metadata about these workflows–called Directed Acyclic Graphs–and the tasks contained within them. Qlik Talend is a data integration product that facilitates extract, transform, and load (ETL) processes; the new collector will identify sources and targets of these processes and harvest lineage relationships representing the flow of data between them.

These collectors will initially be available as on-premise collectors only, but will also be available as cloud collectors in early February.

Streamlined Setup of People Fields

Configuring ownership and stewardship just got easier! In addition to supporting people as collected resources, customers can now utilize their user accounts to populate people fields. This update streamlines setup, providing an intuitive approach that helps teams quickly setup, ensuring seamless attribution and governance from the start and helping end-users connect with the right people. You can read the documentation for this feature here.

Screenshot of people field search and select


Resource Lineage Support in Public API

We’re making it easier than ever to programmatically explore lineage with our new Catalog Lineage Public API endpoint! This update provides flexible query options, allowing customers to tailor lineage exploration to their needs to build lineage based tooling, automations, and integrations. This is a win for all lineage customers looking for deeper insights and more intuitive ways to navigate their data relationships. 

UX Changes Coming Soon

Activity feed for Resources

Soon, we’ll be introducing a new Activity tab on Resources, Glossary and Collection objects that show edit history and other activity in the UI. This will make it easier for users to quickly understand how the resources have been updated and changed. Announcement of the release will soon follow.

Resource page redesign

Along with a new activity feed, we'll be introducing a newly designed details page that offers more intuitive navigation, better use of whitespace, configurable relationship tabs, inline editing, and other features that will make the resources both easier to understand and scan but also easier to enrich and edit. Announcement of this release will follow in this quarter.

data.world October Product Launch


The October release of data.world brings a wide variety of new capabilities and improvements across the platform – read on to learn more about the GA of Databricks Publisher, new collectors for MongoDB and Alteryx, Okta support in SCIM, the GA of the improved search experience, and more!

Additionally, we highlight some changes made to the data.world Open Data Community to improve privacy and preserve the quality of open data and the user experience.


Databricks Publisher Premium Automation

We’re excited to announce the GA launch of the Databricks Publisher Premium Automation! This new feature allows users to seamlessly publish metadata from data.world to Databricks, simplifying the process of managing and synchronizing key data attributes. Specifically, users can now automatically publish table and column descriptions from data.world to Databricks and push selected metadata attributes as Databricks tags. Whether you prefer manual updates or fully automated syncing, this automation ensures that metadata remains consistent between platforms, reducing manual effort and improving data integrity. With data.world now acting as the source of truth, your metadata stays up-to-date across systems effortlessly.

For more information, see the product documentation.


New MongoDB and Alteryx Collectors

This month, we’re excited to announce new MongoDB and Alteryx Collectors, both available in Private Preview. If you’re interested in early access to either of these new collectors, please reach out to your Customer Service Director.

MongoDB Collector

The MongoDB Collector catalogs metadata from MongoDB, helping maintain a comprehensive inventory of MongoDB assets, facilitating better governance, discovery, and utilization of data across your organization.

This collector harvests metadata for MongoDB databases, collections, views, indexes and more.

An example collection from MongoDB

Alteryx Collector

The Alteryx Collector catalogs metadata from Alteryx, helping maintain a comprehensive inventory of Alteryx assets, facilitating better governance, discovery, and utilization of data across your organization.

This collector harvests metadata for workflows, workflow nodes, workflow jobs, connections, schedules and more.

An example collection from Alteryx


Improved lineage for SQL Server

The SQL Server Collector now collects additional lineage relationships not previously captured through SQL parsing using built-in SQL Server functions that describe relationships between objects (such as, in some cases, the columns and tables referenced by views or stored procedures).

For more information and detail, see the description of lineage collected by the SQL Server Collector in the product documentation.


Support for Okta in SCIM

The active Private Preview of SCIM (System for Cross-domain Identity Management) now additionally supports Okta (in addition to Microsoft Entra ID), allowing customers who use Okta as their enterprise identity provider to have automated management of users and groups in data.world.

If you are interested in being part of the SCIM Private Preview, or just want to learn more, please reach out to your Customer Service Director.


Webhook Authorization enhancement

Webhooks now support an optional authorization key parameter to help consuming applications verify the origin and permissions for an incoming webhook. Learn more


Collection Details in Technical Reference

By popular request, the Technical Reference page for catalog resources now includes details about the collections the resource belongs to. Learn more


Relative Time Advanced Search Syntax

Create powerful saved searches for resources by updated and created dates using three new relative time options:

  • `created:today` 
  • `updated:yesterday`
  • `created:{last 30 days}`

See the product documentation on creating advanced searches to learn more.


UX Improvements

Adding the new search experience to Organizations: Our new search experience has been a big hit with users. It’s faster, cleaner, and provides more advanced features in the UI. We've fully retired the classic experience and brought the new search features to the Resources, Glossary and Collection landing pages.


Coming Soon! Advanced relationship editing: We're adding new improvements that make it easier to find the right resources and add or remove more than one relationship at a time.


Coming Soon! More look-and-feel updates: Next up in our work to update and modernize our UI, we'll be swapping the old default avatars to a newer color palette and default avatar design that utilizes letters. This change will also provide a more accessible experience as it gives users the ability to distinguish users and organizations using letters.


Changes to data.world Open Data Community

data.world Open Data Community profiles, datasets, and projects now behind a login wall: To better control the privacy of our users and to protect the effectiveness of the content on our active Open Data Community, we have made the decision to restrict access to profiles, datasets, and projects to account holders. It is always free to join our open data community. 

data.world Open Data Community commenting restrictions: Commenting is now restricted to contributors on datasets and projects in the data.world Open Data Community. Organizations can enable comments on their public datasets through organization settings. This feature is not available or enforced for Enterprise customers on private instance or VPC deployments.


Preview our latest navigation changes

The data.world enterprise catalog provides a 360-degree view of your data resources and semantics. Available in preview today, we've added quick section navigation links on the overview tab of your collection, resource, and glossary pages and separated related resources into their own, sortable, searchable views.

With this upcoming change, users will be able to scan the available metadata and get the insights needed to make decisions efficiently and effectively.


These views offer a highly organized and condensed presentation of the metadata, making it easier to quickly access and understand the information.



We'd love your feedback and thoughts before we roll them into the main UI. To see the views, please click "Turn on preview" in the banner at the top of any collection, resource, or glossary term page. To leave your feedback, please visit the help section (question mark in the lower left of the global navigation) and leave a suggestion via the support link.


You can read more about these changes in our documentation portal. If you're interested in learning more about our data discovery solutions, please visit our website and book a demo or reach out to your customer service representative today. We look forward to helping you and your teams discover your data!

New navigation improvements ready for preview

We are excited to begin rolling out for preview some exciting ENHANCEMENTS to the user experience on our collections, metadata resources, and glossary pages.

Today, you'll notice a new PREVIEW button on these pages. Click on it to get a preview of some of our latest features.

feature 

  • Metadata sections navigation - a table-of-contents-like side menu for easy access to your metadata sections, related resources, etc.
  • Collection hierarchy widget - a navigable tree of your data taxonomy.


COMING SOON 

  • Relationships UX improvements  - a more information-rich view of the related resources, improved edit/suggest flows.
  • Custom icons - dress your custom types in attire that makes sense to you and your catalog users.

To find out more about these new navigation features, please visit our documentation portal.

Updated documentation portal

We are excited to announce upcoming improvements to our help docs portal, including streamlining and consolidating our product documentation, cleaning up deprecated articles and links, and improving the navigation and search experience!

With the site improvements, some of your bookmarks or saved links may no longer work. We have diligently mapped deprecated URLs to the new pags to keep the impact on our users as low as possible. 

If you encounter a link that no longer works, the easiest way to find what you need is to go to the docs portal landing page and search for the information. Please contact support with any questions of issues.

Preview the simplified navigation and Discover page

This May, data.world’s navigation is getting more powerful and even easier to use – we’re adding a Discover button! Preview the changes today to discover all the resources you have access to with a single click.

Screenshot of data.world Discover page. The familiar search page shows a list of all the resources you have access to, available via the Discover left hand navigation. In this screenshot, Discover is highlighted in purple and a Preview banner is at the top of the page.

  • Discover will be added to the navigation to show you all the resources that you can find.
  • Data, Analysis, and Glossary will be removed from the navigation.


The Discover experience transforms the empty search page into an actionable entry point for all of your resources—whether organization-owned or in the open community.

Screenshot of data.world Discover page on the All tab. After a prompt to "Search for bookmarks, resources, or people" there is a list of your recently viewed resources. In this screenshot, Discover is highlighted in purple and a Preview banner is at the top of the page.

Switch to the All tab to reference up to 25 of your recently viewed resources and jump back in where you left off.


Look for the Preview banner to try out the new navigation and Discover experience today. Review the updated documentation or share your feedback.

Coming Soon: Improved Home Page

In March, a new and improved home page is coming to data.world.

Screenshot of the data.world new home page, featuring sections to quickly navigate to your organizations, recent activity, pending alerts, and more.

The new home page will feature personalized views to help you quickly access resources and view alerts. It will also give you a reliable home base to explore and come back to.

Screenshot of the data.world new home page, featuring sections for a new user with Getting Started tips and links to helpful resources.

To preview the new home page experience, look for the coming soon banner when you log into data.world.

Screenshot of the data.world current home page, with a feed of recent activity in the center and quick links on the side.

Coming Soon: Core Navigation Changes

In January, the updated Organization Profile Page will replace existing organization-specific landing pages with a consolidated experience.

Organization members and admins will be able to search, create, and manage resources, collections, members, connections, and more in one place.

Informational banner which reads "Updates to the organization page are coming soon. Check out what this page will look like in the new version."

Landing pages that will be replaced with the new Organization Profile Page will feature a banner with a link to preview the new experience.


The organization landing page will redirect to the new Organization Profile Page.

Now
Coming Soon
The current organization landing page, featuring large tiles with different resource types.
The new organization profile page, with multiple tabs of information and more details.


Organization-specific library and list views will redirect to the Resources tab on the new Organization Profile page, with more advanced filtering options.

Now
Coming Soon
The current organization data catalog, with simple rows of tables and minimal filter options.
The new organization resources tab, with more information on each table and advanced filtering options.


For enterprise organizations, the Glossary landing page will redirect to a new Glossary tab on the Organization page, also with improved filtering options.

NowComing Soon
Show Previous EntriesShow Previous Entries