[B! algorithm] [5ページ] manboubirdのブックマーク

manboubird id:manboubird

algorithmに関するmanboubirdのブックマーク (202)

Eyeo 2014 - Mike Bostock
manboubird 2015/01/22
eyeo

visualization

algorithm
リンク
Dimension Independent Matrix Square using MapReduce
manboubird 2014/10/26
paper

mapreduce

algorithm

recommendation

twitter

dimsum
リンク
All-pairs similarity via DIMSUM
We are often interested in finding users, hashtags and ads that are very similar to one another, so they may be recommended and shown to users and advertisers. To do this, we must consider many pairs of it ems, and evaluate how “similar” they are to one another. We call this the “all-pairs similarity” probl em, sometimes known as a “similarity join.” We have developed a new efficient algorithm to so
manboubird 2014/10/26
recommendation

dimsum

scalding

algorithm
リンク
Data Algorithms
What Is MapReduce?Simple Explanation of MapReduceWhen to Use MapReduceWhat MapReduce Isn’tWhy Use MapReduce?Hadoop and SparkWhat Is in This Book?What Is the Focus of This Book?Who Is This Book For?Online ResourcesWhat Software Is Used in This Book?Conventions Used in This BookUsing Code ExamplesSafari® Books OnlineHow to Contact UsAcknowledgmentsComments and Questions for This Book Solutions to th
manboubird 2014/09/18
book

algorithm

dataStructure

oreilly

bigData

recommendation

machineLearning
リンク
Top-k文書列挙問題 - DO++
いろいろとありまして去年読んだ論文で面白かったものランキングとか書けなかったのが残念ですが、もしあげるとしたら次の論文は入れると思います（知ったのは年明けだったけど）。 "Space-Efficient Framework for Top-k String Retrieval Probl ems", FOCS 2009, Wing Kai Hon, Rahul Shah and Jeffrey Scott Vitter (pdf) 扱っているのは次のような問題です（説明のため本来のと言い換えています） n個の葉からなる木が入力として与えられ，各葉には色（1以上d以下の整数とします）が与えられています．この時、木中の任意の節点と正整数kがクエリとして与えられたときに、その節点の子孫の中で出現回数が大きい色を順にk個答えよという問題です。簡単に思いつくのは，各節点に適当な個数(d)の答えをあ
manboubird 2014/08/27
topK

algorithm

textClassification
リンク
乱択データ構造の最新事情－MinHash と HyperLogLog の最近の進歩－
MinHash, b-bit MinHash, HyperLogLog, Odd Sketch, HIP Estimator の解説です．
manboubird 2014/08/27
hyperLogLog

slide

algorithm
リンク
Spotify: 曲をシャッフルするのは単純にランダムではいけない - ワザノバ | wazanova
http://labs.spotify.com/2014/02/28/how-to-shuffle-songs/ 1 comment | 0 points | by WazanovaNews ■ comment by Jshiike | 約4時間前 SpotifyのLukáš Poláčekがプレイリストをシャッフルするロジックを改善した取り組みを紹介しています。以前のロジックランダムアルゴリズムには、Fisher-Yates shuffleを利用。順次再生する曲を選ぶロジック同士には依存関係がなく、完全にランダムに選択される。よって、同じアーティストとの曲が連続して再生されることも可能性としてはある。これはギャンブラーの誤謬と呼ばれる現象。例えば、コイントスで表が連続してでると、次は裏が出ると思いがちであるが、常に確率は1/2である。従前の結果が次の結果に影響を与えると考えてし
manboubird 2014/08/24
spotify

video

algorithm
リンク
Ashish Goel
manboubird 2014/08/10
researcher

stanford

twitter

recommendation

algorithm
リンク
AMDM: Algorithms for Modern Data Models
MS&E 317: Algorithms for Modern Data Models Spring 2014, Stanford University Mon, Wed 2:15 PM - 3:30 PM at Meyer 143 Instructors: Ashish Goel | Reza Zadeh We traditionally think of algorithms as running on data available in a single location, typically main memory. In many modern applications including web analytics, search and data mining, computational biology, finance, and scientific computing,
manboubird 2014/08/10
algorithm

dataStructure

ucBerkeley

lecture

Spark
リンク
Redis bitmaps - Fast, easy, realtime metrics -
At Spool, we calculate our key metrics in real time. Traditionally, metrics are performed by a batch job (running hourly, daily, etc.). Redis backed bit maps allow us to perform such calculations in realtime and are extremely space efficient. In a simulation of 128 million users, a typical metric such as “daily unique users” takes less than 50 ms on a MacBook Pro and only takes 16 MB of memory. Spo
manboubird 2014/08/06
redis

bitMap

realtime

algorithm

dataStructure
リンク
http://raftconsensus.github.io/
manboubird 2014/06/24
raft

algorithm

paxos
リンク
Dimension Independent Similarity Computation (DISCO)
MapReduce is a programming model for processing large data sets, typically used to do distributed computing on clusters of commodity computers. With large amount of processing power at hand, it’s very tempting to solve probl ems by brute force. However, we often combine clever sampling techniques with the power of MapReduce to extend its utility. Consider the probl em of finding all pairs of similar
manboubird 2014/06/15
Dimension Independent Similarity Computation

algorithm

scalding

optimization

matrix

sampling
リンク
Scalable image search 2: bag of words | Image search and other things
manboubird 2014/04/07
imageSearch

algorithm
リンク
Scalable image search 1: using the k-d tree | Image search and other things
manboubird 2014/04/07
imageSearch

algorithm
リンク
Image search algorithm: a toy example. | Image search and other things
manboubird 2014/04/07
imageSearch

algorithm
リンク
Tim Roughgarden's Homepage
Professor of Computer Science and member of the Data Science Institute at Columbia University. Head of Research at a16z crypto. Research interests: Design, analysis, applications, and limitations of algorithms. Game theory and microeconomics, especially as applied to networks, auctions, and blockchains/web3. Address: Department of Computer Science Columbia University 500 West 120th Street, Room 45
manboubird 2014/03/16
researcher

algorithmicGameTheory

stanford

lecture

video

algorithm

gameTheory
リンク
Eyetracking Study Reveals What People Actually Look At When Shopping Online
manboubird 2014/01/26
eyeTracking

userInteraction

ec

prediction

algorithm
リンク
広告と機械学習 - Qiita
Machine Learning Advent Calendar向けの記事です。普段はGunosyという会社で社長業をしながら社長をしています。ざっくりいうと結論だけ知りたい人はここだけ広告における機械学習の応用の多くはCTR予測や運用の最適化のため(クエリー予測とか)の予測問題今後は「CVRの予測」や「アクティブなユーザーの予測」がホットな話題になる(加えてその運用をどう最適化するかといった話題も) 現在は検索エンジンの応用例が多い。今後はディスプレイ広告やタイムライン広告への応用が増えていく個人のユーザー属性を集めることが今まで以上にメディアのビジネス的に重要になる広告や推薦エンジンに限らずドメイン知識は非常に重要。ドメイン知識と機械学習の知識を持ったエンジニアが意思決定に携わる会社は今後大きくのびる(と思う) 広告について最近はもっぱら広告の開発をしており、広告分野で
manboubird 2014/01/13
gunosy

ad

machineLearning

algorithm
リンク
https://tech.nextroll.com/media/hllminhash.pdf
manboubird 2013/09/28
HyperLogLog and MinHash A Union for Intersections, Andrew Pascoe

hyperLogLog

minhash

algorithm

paper

adRoll
リンク
All In on Real-time: Hokusai adds a temporal component to Count-Min Sketch
Introduced in 2003 by Cormode and Muthukrishnan, the Count-Min sketch is a popular and simple algorithm for summarizing 1 data streams. In...
manboubird 2013/07/31
countSketch

algorithm

aggregation
リンク
前のページ 1 2 3 4 5 6 7 8 9 10 次のページ

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx