[B! dataManagement][metadataManagement] manboubirdのブックマーク

manboubird id:manboubird

dataManagementとmetadataManagementに関するmanboubirdのブックマーク (24)

The Rise of the Metadata Lake
manboubird 2021/06/14
metadata

metadataManagement

dataManagement
リンク
Data Catalogs Are Dead; Long Live Data Discovery
Image courtesy of Andrey_Kuzmin on ShutterstockAs companies increasingly leverage data to power digital products, drive decision making, and fuel innovation, understanding the health and reliability of these most critical assets is fundamental. For decades, organizations have relied on data catalogs to power data governance. But is that enough? Debashis Saha, VP of Engineering at AppZen, formerly
manboubird 2021/01/19
dataCatalog

dataManagement

metadataManagement
リンク
The Government Data Quality Framework
manboubird 2020/12/06
dama

dataQuality

dataManagement

uk

guideline

metadataManagement

metric
リンク
https://hpi.de/fileadmin/user_upload/fachgebiete/naumann/publications/2017/SIGMOD_2017_Tutorial_Data_Profiling.pdf
- 1 user
- hpi.de
- 学び
manboubird 2020/11/28
slide

sigmod

dataProfiling

tutorial

paper

dataManagement

metadataManagement

dataQuality
リンク
egeria
manboubird 2020/11/23
egeria

odpi

openMetadataAndGovernance

dataGovernance

dataManagement

standard

metadata

metadataManagement
リンク
GitHub - odpi/egeria: Egeria core
manboubird 2020/11/23
egeria

odpi

openMetadataAndGovernance

dataGovernance

dataManagement

standard

metadata

metadataManagement
リンク
Business Glossary support in Google Data Catalog
manboubird 2020/11/19
dataCatalog

googleCloudPlatform

dataDictionary

dataManagement

dataGovernance

businessGlossary

metadataManagement

dataLinage
リンク
Joshua Shinavier – The Knowledge Graph Conference
manboubird 2020/07/19
uber

knowledgeGraph

video

metadataManagement

dataManagement
リンク
Building an Enterprise Knowledge Graph at Uber: Lessons from Reality
manboubird 2020/07/19
uber

knowledgeGraph

video

metadataManagement

dataManagement
リンク
データ基盤のメタデータを継続的に管理できる仕組みを作る - Hatena Developer Blog
こんにちは。MackerelチームでCRE(Customer Reliability Engineer)をしているid:syou6162です。 CREチームではカスタマーサクセスを進めるため、最近データ分析により力を入れています(参考1, 参考2)。データ分析を正確に行なうためには、データに関する正確な知識が必要です。今回はより正確なデータ分析を支えるためのメタデータを継続的に管理する仕組みについて書いてみます。データに対する知識: メタデータデータ分析を正確に行なうためには、データ自身に関する知識(=メタデータ)が必要です。例えば、Mackerelのデータ分析タスクでは以下のような知識が必要とされることが多いです。このテーブル / カラムは何のためのテーブルなのか似たようなカラムとの違い集計条件の違い、などデータがどのような値を取り得るか SELECT column, COU
manboubird 2020/04/16
dataManagement

hatena

dataCatalog

metadata

metadataManagement
リンク
How We Improved Data Discovery for Data Scientists at Spotify - Spotify Engineering
How We Improved Data Discovery for Data Scientists at Spotify At Spotify, we believe strongly in data-informed decision making. Whether we’re considering a big shift in our product strategy or we’re making a relatively quick decision about which track to add to one of our editorially-programmed playlists, data provides a foundation for sound decision making. An insight is a conclusion drawn from d
manboubird 2020/03/04
spotify

dataDiscovery

dataCatalog

metadata

metadataManagement
リンク
How LinkedIn, Uber, Lyft, Airbnb and Netflix are Solving Data Management and Discovery for Machine Learning Solutions
How LinkedIn, Uber, Lyft, Airbnb and Netflix are Solving Data Management and Discovery for Machine Learning Solutions When comes to machine learning, data is certainly the new oil. The processes for managing the lifecycle of datasets are some of the most challenging elements of large scale machine learning solutions. Data ingestion, indexing, search, annotation, discovery are some of the aspects r
manboubird 2020/03/04
dataManagement

dataCatalog

metadata

metadataManagement

linkedIn

uber

Airbnb

netflix
リンク
Open Sourcing WhereHows: A Data Discovery and Lineage Portal
In modern data-driven businesses, the complexity that arises from fast-paced analytics, data mining and ETL processes makes metadata increasingly important. In this blog post, we share our own journey and a new open source effort that aims to boost productivity and data provenance. WhereHows, a project of the LinkedIn Data team, works by creating a central repository and portal for the processes,
manboubird 2020/03/04
linkedIn

dataDiscovery

dataManagement

metadata

metadataManagement

dataCatalog

whereHows
リンク
データテクノロジースペシャル：Yahoo! JAPANにおけるメタデータ管理の試み
2017年3月21日ヒカラボ登壇資料【ヒカ☆ラボ】ゼロから始めるSparkSQL徹底活用！～Sparkのインストールから、 SparkSQLの概要紹介、実務で活用するためのノウハウまでを紹介します～ https://atnd.org/events/85919
manboubird 2020/02/15
yahoo

metadataManagement

slide

dataManagement

developerSummit
リンク
7 Things Every Data Worker Should Know About Metadata
manboubird 2019/12/06
metadataManagement

dataManagement

dataDiscovery
リンク
Google Dataset Search: Building a search engine for datasets in an open Web ecosystem
Philosophy We strive to create an environment conducive to many different types of research across many different time scales and levels of risk. Learn more about our Philosophy Learn more
manboubird 2019/05/17
google

paper

search

dataManagement

metadataManagement

dataLake

webConf
リンク
Marquez: A Metadata Service for Data Abstraction, Data Lineage, and Event-based Triggers
manboubird 2019/04/27
weWork

metadataManagement

dataManagement

marquez

visualization

slide
リンク
Overview
Collect, aggregate, and visualize a data ecosystem's metadata View on GitHub Quickstart Download Overview Marquez is an open source metadata service for the collection, aggregation, and visualization of a data ecosystem’s metadata. It maintains the provenance of how datasets are consumed and produced, provides global visibility into job runtime and frequency of dataset access, centralization of da
manboubird 2019/04/27
weWork

metadataManagement

dataManagement

marquez

visualization
リンク
Databook: Turning Big Data into Knowledge with Metadata at Uber
You’re seeing information for Japan . To see local features and services for another location, select a different city. Show more From driver and rider locations and destinations, to restaurant orders and payment transactions, every interaction on Uber’s transportation platform is driven by data. Data powers Uber’s global marketplace, enabling more reliable and seamless user experiences across our
manboubird 2019/02/10
databook

dataDiscovery

uber

dataLineage

dataManagement

metadataManagement

metadata

schemaManagement
リンク
Metacat: Making Big Data Discoverable and Meaningful at Netflix
by Ajoy Majumdar, Zhen Li Most large companies have numerous data sources with different data formats and large data volumes. These data stores are accessed and analyzed by many people throughout the enterprise. At Netflix, our data warehouse consists of a large number of data sets stored in Amazon S3 (via Hive), Druid, Elasticsearch, Redshift, Snowflake and MySql. Our platform supports Spark, Pre
manboubird 2018/06/18
metacat

netflix

dataManagement

schemaManagement

metadataManagement

dataLineage

metadata
リンク
1 2 次のページ

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx