kimutanskのブックマーク / 2016年9月6日

kimutansk id:kimutansk

2016年9月6日のブックマーク (3件)

Microservices in the Apache Kafka Ecosystem
The document provides an introduction and overview of Apache Kafka presented by Jeff Holoman. It begins with an agenda and background on the presenter. It then covers basic Kafka concepts like topics, partitions, producers, consumers and consumer groups. It discusses efficiency and delivery guarantees. Finally, it presents some use cases for Kafka and positioning around when it may or may not be a
kimutansk 2016/09/06
様々な構成パターンが示されているのは面白いですが、What is Distributed Log? > Kafkaはプロダクト名的に微妙ｗ

stream

kafka
リンク
Next generation tools for data science
By DAVID ADAMS Since inception, this blog has defined “data science” as inference derived from data too big to fit on a single computer. Thus the ability to manipulate big data is essential to our notion of data science. While MapReduce rem ains a fundamental tool, many interesting analyses require more than it can offer. For instance, the well-known Mantel-Haenszel estimator cannot be implemented
kimutansk 2016/09/06
Sparkはグラフを遅延評価するがDataflowは実行前に構築。結果SparkはREPL等を用いた対話処理に向くがDataflowは実行部を分離して様々な基盤上で動作可能になる？この視点の比較は面白い。

stream

spark

dataflow
リンク
NoSQLデータベース：調査と決定のガイダンス（その1） | POSTD
(訳注：2016/10/1、頂きましたフィードバックを元に記事を修正いたしました。) 先月、ハンブルク大学の同僚たちと一緒に SummerSOC 2016 でNoSQLの状況についての講演をしました。メンバーは、 Felix Gessert 、 Wolfram Wingerath 、 Steffen Friedrich 、 Norbert Ritter でした。今回はその講演の要点を記事にまとめました。 Baqend を設立して集めた、私たちのNoSQLの濃厚な知識が皆さんに伝われば幸いです。要約現在、データはかつてない規模で生み出され、消費されています。増え続けるデータ量とリクエスト負荷に対応するために、「NoSQL」データベースシステムという用語で包括されるスケーラブルなデータマネジメントの新しい方法が産み出されてきました。しかし、数多く存在するシステムは、不均一で多様性があるので
kimutansk 2016/09/06
PACELCは性質としては知っていたもののきちんとした記事は読んだことなかった。今度読んでおきましょう。

NoSQL

distributed
リンク
- 2016年9月9日
- 2016年9月6日
- 2016年9月5日