⚡️ data.world March Product Launch: AI, Usability, and Lineage, Supercharged

This month’s release is packed with powerful updates designed to unlock more from your catalog. From the launch of Archie Chat, your AI-powered assistant for instant answers, to a new design and functionality for our Resource pages, we’re bringing faster workflows and deeper insights to every user. We’ve also rolled out new and enhanced collectors—from Amazon DMS to AWS Glue, ADF, and beyond—giving data teams the visibility they need to govern modern, complex data stacks. And coming soon: new Governance Dashboards that bring actionable insights, helping you measure engagement and optimize your data strategy.

🔥 What’s New?

💬 Archie Chat – Your AI-Powered Data Assistant is Live

Say hello to Archie Chat, the intelligent assistant that turns your data catalog into an interactive knowledge hub. Now available in Public Preview, Archie delivers instant, context-aware answers. Archie Chat answers questions and helps you navigate the catalog instantly, using context from your own data resources, glossary terms, and our product documentation to help you get answers faster.

Data consumers can use natural language to ask questions like “Tell me what you know about marketing campaign attribution” or “What does ACV mean?”, and get quick, reliable answers from your catalog. Stewards and admins can ask things like, “What’s the fastest way to add lots of relationships at once?” and get immediate, actionable guidance—no digging through docs required. 

Quick Tip: Archie is now in Public Preview and free to try. Reach out to your Customer Success representative to get started today!


 Resource Page Redesign – Usability Reimagined

The Resource Page redesign and new functionality is here. We've overhauled the resource page experience, creating a more intuitive layout, providing edit history, and other new powerful features that put speed and insight at your fingertips:

  • Streamlined layout and design for faster understanding and navigation
  • Rich activity feed that capture your data's story
  • Inline editing capabilities
  • New preview options
  • Enhanced filtering options

The new Activity feed is now generally available and you can read more about it here. The redesigned Resource Overview page and tabs are in Public Preview and users can read more about that here. You can read more about it here. One click is all it takes to transform your workflow.


🔄 Amazon Data Migration Service Collector – Lineage Through the Cloud Migration Journey

Our new Amazon DMS Collector brings visibility into your cloud migration pipelines by capturing lineage from AWS Data Migration Service (DMS) jobs. Whether you’re migrating on-prem databases to the cloud or using DMS for change data capture (CDC), this collector gives you line-of-sight traceability into how data moves across systems. Now, you can govern these migration flows with confidence, maintain compliance, and ensure your catalog reflects your evolving data architecture.

💫 What’s Improved?

🧬 AWS Glue Collector – Deeper Metadata, Stronger Lineage

We’ve supercharged the AWS Glue Collector to bring in even richer metadata and more complete lineage. The collector now captures detailed metadata from Glue Data Catalog tables—including file types, sizes, serializers, and deserializers—as well as improved job metadata. Even better, it now builds lineage to S3 objects, providing full traceability back to the ultimate source. This enhancement empowers more robust impact analysis and stronger governance across your AWS data landscape.

🔗 Azure Data Factory Collector – Smarter Lineage for Parameterized Pipelines

The ADF Collector just got an upgrade: it now captures lineage from parameterized datasets, surfacing both upstream and downstream references in complex, dynamic pipelines. This gives data teams clearer visibility into data movement across ADF workflows—especially in cases where pipeline logic changes based on parameters—resulting in more accurate lineage, improved trust, and tighter control over your data flows.

🔧 Public API Enhancements – Create & Manage Discussions Programmatically

Now you can create and manage discussions directly through the public API, making it easier to capture critical conversations wherever they happen. Whether integrating with ticketing systems, BI tools, or custom workflows, this enhancement turns your catalog into a true collaboration hub—bridging the gap between data context and team communication. By embedding conversations directly into the metadata layer, you unlock smarter decision-making and greater operational efficiency.


🔮 What’s Next?

📊 Coming Soon - New Governance Dashboards

Later this week, we’re launching a new suite of Governance Dashboards in the Admin Portal—your command center for understanding how your data catalog is being used. These interactive, visual dashboards give instance administrators the tools they need to monitor platform activity, analyze user engagement, and uncover trends across your data.world environment. From search behavior to resource metrics and usage, and daily active users, you now have the clarity to spot what’s working, what’s being overlooked, and where to invest next.

Built with ready-to-use views and flexible filters, these dashboards empower teams to make faster, smarter decisions about content strategy, user enablement, and platform optimization. This feature is exclusively available to users with Instance Administrator permissions in private instances and single-tenant deployments.


🔄 Coming Soon – New Marquez / OpenLineage Collector

Our upcoming Marquez Collector brings support for the OpenLineage standard, enabling teams to capture lineage from custom pipelines built with Python, PySpark, SQL, and more. By pulling lineage directly from the Marquez metadata store, this collector helps you document and govern bespoke or proprietary data pipelines—even when no native connector exists.

🏢 Coming Soon – New SAP HANA Collector

We’re expanding our enterprise coverage with a new SAP HANA Collector, designed to harvest standard database metadata—and even some lineage—from SAP’s powerful data warehousing platform. This collector will help customers bring critical SAP assets into the catalog for better visibility, governance, and reuse.

Ready to unlock more from your data? Contact your Customer Success representative to explore these groundbreaking updates and unlock the full potential of data.world, the simpler and smarter catalog.


🚀 Smarter Data Management: New AI-Powered Tools + UX Upgrades!

This month, we’re rolling out enhancements designed to streamline data management, improve usability, and boost productivity. From AI-powered bulk descriptions to a smoother access request experience—and two exciting launches on the horizon—there’s a lot to explore!

🔥 What’s New?

Archie Bulk Describe – Now in Private Preview (Beta)

Say goodbye to manual descriptions and hello to AI-powered enrichment at scale! Archie Bulk Describe lets you generate high-quality descriptions for multiple resources in just a few clicks, making it an essential tool for curators and data product owners. With this feature, you can quickly generate AI-suggested descriptions, ensuring your catalog remains accurate and insightful—without the tedious manual work.

If you’re not yet part of the Archie Private Preview, reach out to your Customer Success Manager today! Need help getting started? Check out our product documentation for details on how to enable Archie Private Preview features and how to run Archie Bulk Describe in your catalog.


💫 What’s Improved?

A Smoother, Smarter Way to Request Access

We’ve made a series of improvements to access request workflows, making them faster and more intuitive. With auto-filled titles and smart auto-skipping, requesting access to resources—or creating new ones—has never been easier. These updates reduce friction, minimize manual steps, and ensure a seamless, consumer-grade experience for enterprise users.

This is part of our ongoing commitment to refining the user experience to feel simple and intuitive.

🔮 What’s Next?

✨ Resource Page Redesign – A New Era of Usability

We’ve completely reimagined the Resource Details Page to make navigating and understanding data assets easier than ever. With an intuitive new layout, edit history, improved summaries and layout, finding the information you need will feel effortless. We’ve also made everything quicker—quicker access to lineage, quicker metadata edits with inline editing, and quicker ways to explore related data with more configuration, filtering and sorting options. The result? A night-and-day improvement in user experience that will feel simple yet smart.

Very soon, this redesign will be available in Public Preview, giving end-users the opportunity to opt-in and experience the new workflow before full rollout.

💬 Meet Archie Chat – Your AI-Powered Data Assistant

Navigating your data catalog just got smarter. Archie Chat (soon in Public Preview Beta) brings an AI-powered, business-context-aware chat experience to your platform, answering questions instantly, reducing friction, and improving adoption. Instead of digging through documentation, catalog admins can now ask “How do I bulk edit metadata?” and get an immediate, actionable response. Data consumers can type natural language questions like “Is ‘Occupation’ a sensitive data type?” or “What does TMA stand for?” and get more precise answers sourced from the context of your data catalog.

Archie Chat is launching in beta, free for a limited-time, so be among the first of our customers to explore this game-changing feature! If you aren't already an Archie-enabled customer, reach out to your customer service manager to find out how you can be ready to try Archie Chat when it releases.

data.world January Product Launch

January Releases: Expanding Connectivity, Simplifying Setup, and Enhancing Lineage Exploration

This month, we’re delivering powerful updates to enhance metadata collection, streamline setup, and improve lineage exploration. We’re expanding our connectivity with new on-premise collectors for Apache Airflow and Qlik Talend, enabling deeper metadata harvesting and lineage tracking for critical ETL and workflow automation tools. People field configuration is now more intuitive, allowing user accounts to be dynamically selected for ownership and stewardship, reducing manual setup and improving governance. Finally, our new public API endpoint for lineage querying makes it easier for customers to customize lineage exploration with flexible queries and standardized outputs. These updates help teams work smarter, get to insights faster, and build on top of our platform with greater ease. 🚀



Support for Airflow and Talend on-premise Collection

By the end of January, the data.world collector integrations will include new collectors for Apache Airflow and Qlik Talend Data Integration (the on-premise version of Qlik’s Talend product). Airflow is an open source workflow automation tool that many enterprises use to schedule and manage data engineering and analytical tasks. The new collector will harvest metadata about these workflows–called Directed Acyclic Graphs–and the tasks contained within them. Qlik Talend is a data integration product that facilitates extract, transform, and load (ETL) processes; the new collector will identify sources and targets of these processes and harvest lineage relationships representing the flow of data between them.

These collectors will initially be available as on-premise collectors only, but will also be available as cloud collectors in early February.

Streamlined Setup of People Fields

Configuring ownership and stewardship just got easier! In addition to supporting people as collected resources, customers can now utilize their user accounts to populate people fields. This update streamlines setup, providing an intuitive approach that helps teams quickly setup, ensuring seamless attribution and governance from the start and helping end-users connect with the right people. You can read the documentation for this feature here.

Screenshot of people field search and select


Resource Lineage Support in Public API

We’re making it easier than ever to programmatically explore lineage with our new Catalog Lineage Public API endpoint! This update provides flexible query options, allowing customers to tailor lineage exploration to their needs to build lineage based tooling, automations, and integrations. This is a win for all lineage customers looking for deeper insights and more intuitive ways to navigate their data relationships. 

UX Changes Coming Soon

Activity feed for Resources

Soon, we’ll be introducing a new Activity tab on Resources, Glossary and Collection objects that show edit history and other activity in the UI. This will make it easier for users to quickly understand how the resources have been updated and changed. Announcement of the release will soon follow.

Resource page redesign

Along with a new activity feed, we'll be introducing a newly designed details page that offers more intuitive navigation, better use of whitespace, configurable relationship tabs, inline editing, and other features that will make the resources both easier to understand and scan but also easier to enrich and edit. Announcement of this release will follow in this quarter.

data.world October Product Launch


The October release of data.world brings a wide variety of new capabilities and improvements across the platform – read on to learn more about the GA of Databricks Publisher, new collectors for MongoDB and Alteryx, Okta support in SCIM, the GA of the improved search experience, and more!

Additionally, we highlight some changes made to the data.world Open Data Community to improve privacy and preserve the quality of open data and the user experience.


Databricks Publisher Premium Automation

We’re excited to announce the GA launch of the Databricks Publisher Premium Automation! This new feature allows users to seamlessly publish metadata from data.world to Databricks, simplifying the process of managing and synchronizing key data attributes. Specifically, users can now automatically publish table and column descriptions from data.world to Databricks and push selected metadata attributes as Databricks tags. Whether you prefer manual updates or fully automated syncing, this automation ensures that metadata remains consistent between platforms, reducing manual effort and improving data integrity. With data.world now acting as the source of truth, your metadata stays up-to-date across systems effortlessly.

For more information, see the product documentation.


New MongoDB and Alteryx Collectors

This month, we’re excited to announce new MongoDB and Alteryx Collectors, both available in Private Preview. If you’re interested in early access to either of these new collectors, please reach out to your Customer Service Director.

MongoDB Collector

The MongoDB Collector catalogs metadata from MongoDB, helping maintain a comprehensive inventory of MongoDB assets, facilitating better governance, discovery, and utilization of data across your organization.

This collector harvests metadata for MongoDB databases, collections, views, indexes and more.

An example collection from MongoDB

Alteryx Collector

The Alteryx Collector catalogs metadata from Alteryx, helping maintain a comprehensive inventory of Alteryx assets, facilitating better governance, discovery, and utilization of data across your organization.

This collector harvests metadata for workflows, workflow nodes, workflow jobs, connections, schedules and more.

An example collection from Alteryx


Improved lineage for SQL Server

The SQL Server Collector now collects additional lineage relationships not previously captured through SQL parsing using built-in SQL Server functions that describe relationships between objects (such as, in some cases, the columns and tables referenced by views or stored procedures).

For more information and detail, see the description of lineage collected by the SQL Server Collector in the product documentation.


Support for Okta in SCIM

The active Private Preview of SCIM (System for Cross-domain Identity Management) now additionally supports Okta (in addition to Microsoft Entra ID), allowing customers who use Okta as their enterprise identity provider to have automated management of users and groups in data.world.

If you are interested in being part of the SCIM Private Preview, or just want to learn more, please reach out to your Customer Service Director.


Webhook Authorization enhancement

Webhooks now support an optional authorization key parameter to help consuming applications verify the origin and permissions for an incoming webhook. Learn more


Collection Details in Technical Reference

By popular request, the Technical Reference page for catalog resources now includes details about the collections the resource belongs to. Learn more


Relative Time Advanced Search Syntax

Create powerful saved searches for resources by updated and created dates using three new relative time options:

  • `created:today` 
  • `updated:yesterday`
  • `created:{last 30 days}`

See the product documentation on creating advanced searches to learn more.


UX Improvements

Adding the new search experience to Organizations: Our new search experience has been a big hit with users. It’s faster, cleaner, and provides more advanced features in the UI. We've fully retired the classic experience and brought the new search features to the Resources, Glossary and Collection landing pages.


Coming Soon! Advanced relationship editing: We're adding new improvements that make it easier to find the right resources and add or remove more than one relationship at a time.


Coming Soon! More look-and-feel updates: Next up in our work to update and modernize our UI, we'll be swapping the old default avatars to a newer color palette and default avatar design that utilizes letters. This change will also provide a more accessible experience as it gives users the ability to distinguish users and organizations using letters.


Changes to data.world Open Data Community

data.world Open Data Community profiles, datasets, and projects now behind a login wall: To better control the privacy of our users and to protect the effectiveness of the content on our active Open Data Community, we have made the decision to restrict access to profiles, datasets, and projects to account holders. It is always free to join our open data community. 

data.world Open Data Community commenting restrictions: Commenting is now restricted to contributors on datasets and projects in the data.world Open Data Community. Organizations can enable comments on their public datasets through organization settings. This feature is not available or enforced for Enterprise customers on private instance or VPC deployments.


Announcing Enhanced Email Notification Options

Visit your notifications settings page to customize the transactional emails you receive from data.world.

You can choose to:

  • Turn off all non-essential email communications
  • Unsubscribe from a category of email notifications
  • Customize which digests you receive
  • Customize dataset and project activity notifications

Learn more

Advanced Search now in the global search bar

We are happy to announce the release of our latest feature: Advanced Search within the global search bar. 


In addition to adding Advanced Search to our global search bar, we've improved it by including the ability to pre-scope your search to a certain Organization. If you are an Enterprise customer with multiple organizations, you can now pre-select the organization to which you want to limit your search.

But wait, there's more!

For customers who have hierarchical collections, you'll want to check out our beta release of the collection picker in the Advanced Search modal. This gives users the ability to quickly scope their search to a branch of collections in the domain hierarchy. Be sure to turn on the beta feature flag in the advanced settings to see this feature.
 


The new advanced search features in the global search bar are designed to provide a more intuitive and powerful data discovery experience from login. With these new capabilities, you'll be able to find the data and insights you need, faster than ever before.

Refer to the advanced search documentation to learn more.

We're committed to continually improving our platform and are eager to hear your feedback. Please feel free to reach out via our support portal to share your thoughts, experiences, and suggestions regarding our new Advanced Search features. Visit our website to learn more about Data Discovery or book a demo. Together, let's unlock the true potential of data-driven decision-making.

Preview our latest navigation changes

The data.world enterprise catalog provides a 360-degree view of your data resources and semantics. Available in preview today, we've added quick section navigation links on the overview tab of your collection, resource, and glossary pages and separated related resources into their own, sortable, searchable views.

With this upcoming change, users will be able to scan the available metadata and get the insights needed to make decisions efficiently and effectively.


These views offer a highly organized and condensed presentation of the metadata, making it easier to quickly access and understand the information.



We'd love your feedback and thoughts before we roll them into the main UI. To see the views, please click "Turn on preview" in the banner at the top of any collection, resource, or glossary term page. To leave your feedback, please visit the help section (question mark in the lower left of the global navigation) and leave a suggestion via the support link.


You can read more about these changes in our documentation portal. If you're interested in learning more about our data discovery solutions, please visit our website and book a demo or reach out to your customer service representative today. We look forward to helping you and your teams discover your data!

📣 Announcing our latest search enhancements

Over the past several weeks, we've introduced a set of search IMPROVEMENTS we want to share:

  1. Partial title search. Allows users to search for resources by entering just a portion of the title (3+ characters), making it easier to find the right data.
  2. More related metadata search. From the context of a resource page, this improvement allows users the ability to search and filter related resources based on all the searchable metadata fields of a resource - including custom fields - which means it is now easier to filter large lists.
  3. More camel case support. We have extended camel case support to our relationship filters. This makes it easier to find resources that have complex names that combine uppercase and lowercase letters.
  4. Updated column search cards. This update improves the column search experience by providing users with additional information about columns such as database and datatype, making it easier to understand what each resource is without clicking through and back between the detail pages.

At data.world, our goal is to help organizations unlock the full potential of their data. We're constantly improving search in order to better serve our customers looking to take data management and discovery to the next level.

If you're interested in learning more about our data discovery solutions, please visit our website and book a demo. You can also read more about our search features on the docs portal. We look forward to helping you manage your data and transform the way your users discover it!

New navigation improvements ready for preview

We are excited to begin rolling out for preview some exciting ENHANCEMENTS to the user experience on our collections, metadata resources, and glossary pages.

Today, you'll notice a new PREVIEW button on these pages. Click on it to get a preview of some of our latest features.

feature 

  • Metadata sections navigation - a table-of-contents-like side menu for easy access to your metadata sections, related resources, etc.
  • Collection hierarchy widget - a navigable tree of your data taxonomy.


COMING SOON 

  • Relationships UX improvements  - a more information-rich view of the related resources, improved edit/suggest flows.
  • Custom icons - dress your custom types in attire that makes sense to you and your catalog users.

To find out more about these new navigation features, please visit our documentation portal.

Get a quick summary of the access your members have

We have some awesome news! Our member access summary page is now live!

The member access summary makes it much easier for members to understand the various levels of access they have in an organization. As we work to make our access levels more flexible and granular, this page also gives our org admins the ability to quickly audit the level of access their members have and take action.


Be the first to learn about the member access summary page in our docs portal.

Show Previous EntriesShow Previous Entries