[B! dataManagement][dataPlatform] manboubirdのブックマーク

manboubird id:manboubird

dataManagementとdataPlatformに関するmanboubirdのブックマーク (21)

GOのデータ・AIを活用する「組織」を30分で紹介
GO TechTalk #25 で発表した資料です。 ■ YouTube https://www.youtube.com/live/lH0z49oHRWI?feature=shared&t=98 ■ connpass https://jtx.connpass.com/event/306325/
manboubird 2024/03/03
dataPlatform

dataManagement

slide

goInc

mlOps

kpi
リンク
Data Mesh — A Data Movement and Processing Platform @ Netflix
By Bo Lei, Guilherme Pires, James Shao, Kasturi Chatterjee, Sujay Jain, Vlad Sydorenko BackgroundRealtime processing techno logies (A.K.A stream processing) is one of the key factors that enable Netflix to maintain its leading position in the competition of entertaining our users. Our previous generation of streaming pipeline solution Keystone has a proven track record of serving multiple of our ke
manboubird 2022/10/09
dataPlatform

dataMesh

dataManagement
リンク
Introduction to Data Mesh | A paradigm shift in managing analytical data - KGC 2021 - Knowledge Graph Conference
manboubird 2022/06/17
Introduction to Data Mesh | A paradigm shift in managing analytical data

dataMesh

dataManagement

dataPlatform

video
リンク
LINEの大規模なData PlatformにData Lineageを導入した話
LINE株式会社は、2023年10月1日にLINEヤフー株式会社になりました。LINEヤフー株式会社の新しいブログはこちらです。 LINEヤフー Tech Blog はじめにこんにちは、Data Platform室IU Devチームの島村です。 Data Platform室では、約400ペタバイトのデータ分析基盤を運用しております。このData Platformは、「Information Universe」(以下、IU) と呼ばれており、LINEの様々なアプリケーションから生成されるデータをLINE社員が活用できるように、データの収集、処理、分析、可視化を提供しています。私が所属するIU Devチームでは、「IU Web」を開発しています　IU Webは、IUのデータを安全にかつ効率的に活用できるようにするData Catalog機能を提供しており、LINEグループのあらゆるサービスか
manboubird 2022/06/05
dataLineage

dataManagement

dataPlatform
リンク
エムスリーのデータ基盤を支える設計パターン - エムスリーテックブログ
こんにちは、エムスリーエンジニアリンググループの鳥山 (@to_lz1)です。ソフトウェアエンジニアとして製薬企業向けプラットフォームチーム / 電子カルテチームを兼任しています。ソフトウェアエンジニアという肩書きではありますが、私は製薬企業向けプラットフォームチームで長らくデータ基盤の整備・改善といったいわゆる "データエンジニア" が行う業務にも取り組んできました。本日はその設計時に考えていること / 考えてきたことをデータ基盤の設計パターンという形でご紹介しようかと思います。多くの企業で必要性が認識されるようになって久しい "データ基盤" ですが、まだまだ確立された知見の少ない領域かと思います。少しでもデータエンジニアリングを行う方の業務の参考になれば幸いです。データ基盤の全体像収集部分の構成 RDBデータログデータ活用部分の構成データマートの実例「データ基
manboubird 2021/10/17
m3

architecture

dataManagement

dataPlatform
リンク
レガシー化したData Pipelineの廃止 ― メルカリのData Architectのお仕事例｜Mercari Analytics Blog
Analytics Infra チームの@hizaです。この記事ではメルカリの分析環境を改善した事例を紹介します。今回は「運用に課題があってリプレースしたいが、業務への影響が大きすぎてリプレースできない」そんな板挟みな状況を解決した事例です。また、その紹介を通じてメルカリのData Architectがどんな仕事をしているのかその一部を感じてもらえる記事をめざしました。メルカリのデータ活用の現状メルカリには様々な職種でデータを活用する文化があります。 AnalystやML Engineerの他にも、PdMやCustomer Supportなども業務にデータを活用しています。結果として社内のBigQueryユーザー数は月間800名を超えるほどになりました。こういった環境ではデータが良く整備されている事が事業の成果に大きく影響しえます。例えば、使いやすいDWHがあれば多数の社員の業
manboubird 2021/09/16
mercari

architecture

dataManagement

dataEngineering

dataPlatform
リンク
Accelerate the Development of AI Applications | Scale AI
With Your DataMake the best models with the best data. Scale Data Engine powers nearly every major foundation model, and with Scale GenAI Platform, leverages your enterprise data to unlock the value of AI.
manboubird 2021/03/27
dataset

scale

startup

artificialIntelligence

dataManagement

dataPlatform
リンク
just4fun.fm - #4 データ基盤、データの民主化と文化革命 with @syou6162
manboubird 2021/01/06
podcast

dataManagement

dataPlatform

mercari
リンク
大量のユーザーデータを横断的に使うために　LINEのデータサイエンティストが気をつけているいくつかのこと
2020年11月25〜27日の3日間、LINE株式会社が主催するエンジニア向け技術カンファレンス「LINE DEVELOPER DAY 2020」がオンラインで開催されました。そこで LINEのフェローであり、Data Science and Engineeringセンターに所属する並川淳氏が、「LINEではどのようにサービス横断でのデータ活用を実現しているのか」というテーマで、LINEにおけるデータの扱い方について共有しました。 LINEにおけるデータ活用の取り組み並川淳氏（以下、並川）：本日は「LINEではどのようにサービス横断でのデータ活用を実現しているのか」というタイトルで、並川が発表いたします。私は、LINEではふだん機械学習に関わる開発全般を担当しています。ですが、今日は機械学習に限らず、LINEにおけるデータ活用の取り組みについて幅広く紹介させてもらえればと思っています。よ
manboubird 2020/12/03
machineLearning

line

slide

mlOps

dataPlatform

dataManagement

management
リンク
Airbnb’s Forecasting Series: Data Platform | by Jerry Chu | Medium
manboubird 2020/11/15
Airbnb

dataPlatform

airflow

dataManagement
リンク
"壊れにくい"データ基盤を構築するためにMackerelチームで実践していること - Hatena Developer Blog
こんにちは。MackerelチームにおいてCRE（Customer Reliability Engineer）をしているid:syou6162です。主にカスタマーサクセスを支えるデータ基盤の構築や、データ分析を担当しています。今回は、壊れにくいデータ基盤を構築するため、Mackerelチームで実践していることを紹介します。なぜ壊れにくいデータ基盤を構築するのかデータ基盤が“壊れている”とはどういうことか壊れてないだけでなく、壊れたら気付ける前提とするシステム構成壊れたことに気付けるよう監視する 1. バッチジョブが失敗したことに気付く 2. 投入されたデータの性質を監視する 3. ビューが壊れてないかを監視する 4. 利用状況を監視するそもそも壊れてない状態を保つ 1. データリネージを元に修正できるようにする 2. 使われていないテーブルやビューは定期的に掃除おわりに参
manboubird 2020/10/25
dataManagement

bigQuery

monitoring

validation

mackerel

dataQuality

dataPlatform
リンク
Uber’s Data Platform in 2019: Transforming Information to Intelligence
You’re seeing information for Japan . To see local features and services for another location, select a different city. Show more Uber’s busy 2019 included our billionth delivery of an Uber Eats order, 24 million miles covered by bike and scooter riders on our platform, and trips to top destinations such as the Empire State Building, the Eiffel Tower, and the Golden Gate Bridge. Behind the scenes
manboubird 2020/09/28
uber

dataPlatform

dataManagement
リンク
マイクロサービスのための分散データ〜イベントソーシング vs チェンジデータキャプチャ - 赤帽エンジニアブログ
インテグレーションのためのミドルウェア製品のテクニカルサポートを担当している山下です。今回はレッドハットのシニアアーキテクトである Eric Murphy さんによる「マイクロサービスのための分散データ〜イベントソーシング vs チェンジデータキャプチャ(CDC)」の翻訳記事です。この記事では、イベントソーシング、CDC、CDC + Outboxパターン、CQRSをそれぞれ簡単に説明しながら、それらの特性の違いを比較します。また、イベントソーシングとCQRSの簡易な説明がなされている他、あまり明確に語られることが少ないもののソフトウェアの設計に大きな影響をおよぼすドメインイベントとチェンジイベントの違いにも触れられています。 [原文] Distributed Data for Microservices — Event Sourcing vs. Change Data Captur
manboubird 2020/04/26
changeDataCapture

microServiceArchitecture

eventSourcing

outbox

debezium

designPattern

dataPlatform

dataManagement

redhat
リンク
Secure Knowledge Graph for Trusted AI | Fluree
Intelligent Database
manboubird 2020/04/14
fluree

dataManagement

dataEngineering

dataPlatform
リンク
データ基盤開発ひとり - Qiita Advent Calendar 2019 - Qiita
The Qiita Advent Calendar 2019 is supported by the following companies, organizations, and services.
manboubird 2020/02/17
dataManagement

dataPlatform
リンク
安心して使えるデータ基盤を作る
2020/01/24 大規模データ集積/分析基盤 Meet-up! の発表資料です。
manboubird 2020/02/13
slide

dataPlatform

dataManagement

dataQuality
リンク
Big Data, Big Decisions: Finding the Right Technology for Interactive Analytics at Salesforce - Salesforce Engineering Blog
Big Data, Big Decisions: Finding the Right Techno logy for Interactive Analytics at Salesforce written Ram Sangireddy and Kartik Chandrayana, Product Management, Big Data Platform @ Salesforce, with contributions from our colleagues at Salesforce: Andrew Torson, William Earl, Vincent Poon, and Lars Hofhansl The world has come a long way from the business needs around data and supporting techno logie
manboubird 2019/12/22
analytics

dataScience

salesforce

dataPlatform

dataManagement
リンク
株式会社フライウィール - FLYWHEEL, Inc.
「DXがわからない」「部門間でデータが連携されていない」「データ活用する目的がわからない」「人の手に頼った業務が多い」
manboubird 2019/11/20
flywheel

startup

dataPlatform

artificialIntelligence

machineLearning

dataManagement
リンク
https://dl.acm.org/doi/10.1145/3299869.3314050
manboubird 2019/07/27
paper

dataManagement

sigmod

dataPlatform

machineLearning

apple
リンク
Uber’s Big Data Platform: 100+ Petabytes with Minute Latency
Data / ML, EngineeringUber’s Big Data Platform: 100+ Petabytes with Minute LatencyOctober 17, 2018 / Global Uber is committed to delivering safer and more reliable transportation across our global markets. To accomplish this, Uber relies heavily on making data-driven decisions at every level, from forecasting rider demand during high traffic events to identifying and addressing bottlenecks in our
manboubird 2018/10/22
uber

dataPlatform

hadoop

schemaManagement

dataManagement

standardization

dataModeling

dataQuality
リンク
1 2 次のページ

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx