BACK TO ALL POSTS

DataHub Community Updates: March’23 Rundown

Metadata

Data Engineering

Open Source

Data

Project Updates

Maggie Hays

Apr 7, 2023

Metadata

Data Engineering

Open Source

Data

Project Updates

Hello, hello, DataHub Enthusiasts!

Ready to hear what’s been the amazing DataHub Community has been up to in March? Let’s get straight to it!

The DataHub Community Continues to Grow and Thrive

We added over 350 members in March, and our Slack community grew to over 6700+ members with 1000 active users every week!

DataHub Community Snapshot - March 2023

If you’re new to DataHub, I can’t recommend our Slack channel enough — you can be sure that this amazing community always has your back!

Coming Soon: DataHub Community Council (DCC)

I’ll say it again: the DataHub Community is growing FAST 😅😅 Let’s team up to keep up!

Soon, we’ll be rolling out the DataHub Community Council, a forum for DataHub Enthusiasts to collaborate closely with the Core DataHub Team.

As a member, you can

  • Participate in private, expert panel discussions with the Core DataHub Team
  • Get early access to future open-source releases
  • Inform and guide the DataHub strategy and roadmap and influence our quarterly roadmap prioritization via a DCC forum
  • Get direct access to collaborate with other DCC members

…AND

  • Own and show off some Custom DCC Swag! 😎

Formal annoucement to come — watch this space for more updates!

Roadmap Updates and Release Highlights

We had some BIG deliverables to reach in Q1'23 — here’s where we landed by the end of March:

We recently released DataHub v0.10.1 , with a continued focus on improving user and developer experience as well as metadata ingestion. Here’s what this version brings with it:

Here are a few exciting highlights in v0.10.1:

Snowflake Tag and Term Propagation

We’re introducing a DataHub Action that will allow you to propagate tags and terms between entities that are connected via lineage — all the way into downstream warehouses like Snowflake.

Given Snowflake’s own comprehensive tagging capabilities, this improvement will enable you to attach tags to Snowflake tables on DataHub, and propagate those tags into Snowflake.

This way, you get all the benefits of Snowflake’s data management capabilities — by ensuring that these tags show up automatically from DataHub.

Improved Search

We’ve done a ton of work to make our search functionality more reliable, detailed, responsive, and intuitive — across a large number of entities.

With this relase, DataHub Search is going to feel much snappier while being much more robust at a high scale.

Ingestion Improvements

We’ve built improvements around lineage extraction, entity descriptions, memory usage improvements, etc, with a focus on BigQuery & Power BI.

Community Case Study: Jumio’s Journey with DataHub

During the March Town Hall, Ray Suliteanu from the Jumio (https://jumio.com/) team shared how they are using DataHub for easier discovery, compliance, and improved productivity for complex data management — with DataHub allowing them to work with distributed datasets, bringing in data from all over with its push-pull model (as opposed to having a central entity fetching all the data).

Check out this video to know how DataHub is helping Jumio with improved access control for governance and better search — beyond datasets, extending to models, features, jobs, and other metadata.

How Jumio is leveraging DataHub for data discovery, governance, and improved data management and productivity

Product Updates: Improving and Simplifying DataHub

DataHub 201: Data Debugging

We’re working on a few features that will enable not just layer-to-layer debugging on DataHub, but also aid in end-to-end debugging of data issues and flagging data quality issues across the lineage graph.

Here’s John telling you everything you need to know about how data debugging in DataHub just got a LOT easier.

Sneak Peek: Upcoming Improvements to Search

During the March Town Hall, we gave a preview of upcoming changes to the search experince. We’re bringing some exciting improvements to reduce the number of steps required to search for, and ultimately find, the data that matters most.

Check out this video where Brittanie Jakubowich from the Acryl Data Team breaks down what changes are to come:

Getting Started with DataHub’s APIs

We hear time & time again from our Community Members that DataHub’s APIs are gamechangers for how they manage metadata within their organization. We want to make sure that a wide range of folks understand the power of our APIs, and have a clear set of examples of how to go about using them.

Hyejin Yoon (Developer Relations Engineer, DataHub) has been putting in some serious work to ramp up our API guides. Check out this video to learn all about them:

DataHub Integrations: Documentation Support

We now have a dedicated go-to page for all DataHub integrations. This will help you understand — at a glance — all the different systems that DataHub integrates with. You can search across connection types (push-pull), features, and platform types.

One-stop-shop view of DataHub’s supported integrations

Community Contributions and Shoutouts

This is my absolute favorite part of my job — showing well-deserved appreciation to folks in the DataHub Community that are going above & beyond to contribute back to the project.

A MASSIVE shoutout to our March Project contributors — we had 20 first-time contributors, which is simply outstanding!

DataHub Project Contributors - March 2023

In March, we had _a ton_ of Community Members step up to help others out in Slack; let’s show them some love! HUGE shoutout to these folks:

Supporting the DataHub Community

Write for the DataHub Community Blog!

If you have something to share with the community — about how you’re using DataHub, challenges you’re solving, data governance, and other exciting data discovery projects, why don’t you consider contributing to the DataHub Blog Community Program ?

For inspiration, check out this month’s entry by community member Ada Draginda who shares how Notion is using DataHub to automate propagation between Data Hub and dbt.

Check it out: Automating Propagations with DataHub and DataHub-Tools .

I continue to be mindblown (and thrilled!) by the velocity of this amazing community and can’t wait to see what the next quarter holds for us!

Metadata

Data Engineering

Open Source

Data

Project Updates

NEXT UP

Acryl Cloud for ML and AI Practitioners

When organizations struggle to operationalize ML or AI solutions, the root causes are usually data-related. ML and AI teams can’t find the data they need to define use cases, engineer features, or train their models. When they can find it, they can’t always use it—because they don’t know what it is, where it came from, who created it, when, or for what purpose. Lacking context, any dataset is a black box. Discover why a modern data catalog and metadata platform is a foundational element of any ML or AI platform.

Harshal Sheth

2024-01-31

Making Data Relevant Again

Increasingly, decision-makers and stakeholders just don’t trust their data and analytics—usually because what they’re seeing is out-of-date, incomplete, inconsistent, and sometimes flat-out wrong.

Swaroop Jagadish

2024-01-29

Acryl Cloud For Data Leaders and Practitioners

Data work is a true team sport. Each and every data asset is the product of a clear distribution of labor, with people in a diversity of roles—including data practitioners, software developers, architects, governance authorities, and business domain experts—working collaboratively.

Swaroop Jagadish

2023-12-11

Get started with Acryl today.
Acryl Data delivers an easy to consume DataHub platform for the enterprise
See it in action
Acryl Data Logo
Acryl DataHub
Acryl Observe
TermsPrivacySecurity
© 2024 Acryl Data