[B! algorithm] [8ページ] manboubirdのブックマーク

manboubird id:manboubird

algorithmに関するmanboubirdのブックマーク (202)

http://www.cognotics.com/opencv/servo_2007_series/part_2/sidebar.html
manboubird 2011/01/09
faceDetection

tutorial

algorithm

opencv
リンク
Sign in - Google Accounts
Not your computer? Use a private browsing window to sign in. Learn more about using Guest mode
manboubird 2011/01/01
algorithm

suffixarray

java

c++
リンク
Spam Detection
manboubird 2010/12/29
machineLearning

spam

mail

algorithm

NaiveBayes
リンク
Sorting Values With JavaScript | Envato Tuts+
If you already understand the basics of JavaScript arrays, it's time to take your skills to the next level with more advanced topics. In this series of tutorials, you'll explore intermediate-level topics for programming with arrays in JavaScript. Sorting arrays is one of the most common tasks you'll do when programming in JavaScript. Therefore, as a JavaScript programmer, it’s essential that you l
manboubird 2010/12/26
javascript

algorithm
リンク
ymuto109の日記 - AdaBoostに関する調査
/* * Yahoo! Japan ディベロッパーネットワーク：テキスト解析：形態素解析サービスの利用 * http://developer.yahoo.co.jp/webapi/jlp/ma/v1/parse.html * 2015.11.18 */ import java.net.HttpURLConnection; import java.net.URL; import java.net.URLEncoder; import java.io.InputStream; import javax.xml.parsers.DocumentBuilder; import javax.xml.parsers.DocumentBuilderFactory; import org.w3c.dom.Document; import org.w3c.dom.NodeList; import java
manboubird 2010/12/18
machineLearning

algorithm
リンク
MapReduce デザインパターン (6) - めもめも
"Graph algorithms" を考えます。下図の S, A, B, C, D は 5 つの Web ページで、矢印は、Web リンクを表しているとします。（このような関係は、『有向グラフ』と捉えることができます。）このとき、ページ S からスタートして、最低、何クリックで、A 〜 D の各ページに到達するかを計算します。 ___________ ___________ ｜　　　　↓| ↓ Ｓ → Ａ → Ｂ → Ｃ ← Ｄ ↑____|↑__________|____↑ 再帰的な計算をごにょごにょすれば、何とかなる気がしますが（普通は、ダイクストラのアルゴリズムを使います）、MapReduce の場合は、基本的には『端から順番になめていく』タイプの計算しかできないので、次のように考えます。 1. 各ページの求める答えを（最初は不明なので）『？』と定義します。（S 自身は、不明で
manboubird 2010/12/14
graph

algorithm

mapreduce

hadoop
リンク
Evaluating the efficiency of Internet banner advertisements - ScienceDirect
manboubird 2010/12/11
adServing

comparison

algorithm

paper
リンク
Ad serving kdd2008_2
manboubird 2010/12/09
ad

kdd

slide

algorithm
リンク
https://dl.acm.org/citation.cfm?id=1402010
manboubird 2010/12/09
kdd

algorithm

ad

comparison

paper
リンク
Google Answers: categorization
manboubird 2010/12/09
mahout

machineLearning

algorithm

feature
リンク
Log In - Apache Software Foundation
Powered by a free Atlassian Confluence Open Source Project License granted to Apache Software Foundation. Evaluate Confluence today. Čeština Dansk Deutsch Eesti English (UK) English (US) Español Français Íslenska Italiano Magyar Nederlands Norsk Polski Português Română Slovenčina Suomi Svenska Русский 中文日本語 한국어 Powered by Atlassian Confluence 7.19.30 Printed by Atlassian Confluence 7.19.30 Report
manboubird 2010/12/09
ad

algorithm

logisticRegression

mahout

predictiveAnalysis
リンク
大規模データで単語の数を数える - ny23の日記
大規模データから one-pass で it em（n-gram など）の頻度を数える手法に関するメモ．ここ数年，毎年のように超大規模な n-gram の統計情報を空間／時間効率良く利用するための手法が提案されている．最近だと， Storing the Web in Memory: Space Efficient Language Models with Constant Time Retrieval (EM NLP 2010) とか．この論文では，最小完全ハッシュ関数や power-law を考慮した頻度表現の圧縮など，細かい技術を丁寧に組み上げており，これぐらい工夫が細かくなってくるとlog-frequency Bloom filter (ACL 2007) ぐらいからから始まった n-gram 頻度情報の圧縮の研究もそろそろ収束したかという印象（ちょうど論文を読む直前に，この論文の7節の
manboubird 2010/12/05
ngram

scallability

topK

algorithm
リンク
Kazuho@Cybozu Labs: アクセスログからアテンション（注目情報）をデータマイニングする手法について
多数のユーザーの行動記録からアテンション情報（注目されているデータが何か）をデータマイニングしたいというのは、大量のデータを扱っているウェブサイトにおいては自然と出てくる要求です。そこで、先月末にサービスを終了したサービス「パストラック」において使用していた、アクセスログから注目度（人気度）の高いウェブページや人名等のキーワードを抽出するためのアルゴリズムを紹介しておきたいと思います。たとえばはてなブックマークのような、ユーザーの能動的な行為（「ブックマークする」という作業）から注目情報を抽出するのは決して難しいことではありません。それは、直近の一定期間内のブックマーク数＝注目度、という前提が上手に機能するからです。現に、はてなブックマークの人気エントリーは、最近24時間程度の期間内にブックマークしたユーザー数の多い URL を降順で並べているように見受けられます。しかし、アクセスログ
manboubird 2010/12/04
algorithm

ranking

cybozu
リンク
自然言語処理勉強会＠東京第1回の資料 - 木曜不足
本日の tokyotextmining こと自然言語処理勉強会＠東京第1回で話す「Webページの本文抽出 using CRF」の資料(自己紹介は除く)です。以前、Ruby で作った本文抽出モジュールを機械学習の技術を使って作り直してみたら、というお話。 CRF は Conditional Random Fields の略。 Web本文抽出 using crf from Shuyo Nakatani 実装はこのあたり。 http://github.com/shuyo/iir/blob/master/sequence/crf.py http://github.com/shuyo/iir/blob/master/sequence/pg.py http://github.com/shuyo/iir/blob/master/extractcontent/webextract.py 【追記】
manboubird 2010/12/04
informationExtraction

CRF

algorithm

slide
リンク
That's close to the current version, but a little out of date. Here's the code r... | Hacker News
manboubird 2010/12/02
ranking

algorithm

hackerNews
リンク
How Reddit ranking algorithms work - amix.dk
This is a follow up post to How Hacker News ranking algorithm works. This time around I will examine how Reddit's default story and comment rankings work. Reddit's algorithms are fairly simple to understand and to implement and in this post I'll dig deeper into them. The first part of this post will focus on story ranking, i.e. how are Reddit stories ranked? The second part of this post will focus
manboubird 2010/12/02
ranking

algorithm

reddit
リンク
How Hacker News ranking algorithm works - amix.dk
In this post I'll try to explain how Hacker News ranking algorithm works and how you can reuse it in your own applications. It's a very simple ranking algorithm and works surprising well when you want to highlight hot or new stuff. Digging into news.arc code Hacker News is implemented in Arc, a Lisp dialect coded by Paul Graham. Hacker News is opensource and the code can be found at arclanguage.or
manboubird 2010/12/02
def calculate_score(votes, item_hour_age, gravity=1.8): return (votes - 1) / pow((item_hour_age+2), gravity)

ranking

algorithm

hackerNews
リンク
アマゾン、440億円の大型買収へ。ベゾスも恐れるダイパーズ、驚異的成長の秘密：In the looop：オルタナティブ・ブログ
Amazonが、Quidsiを買収すると複数のブログメディアが報じた。買収額は540百万ドル(約440億円)だ。 Quidsi(以下、運用サイトDiapers[ダイパーズ]と略する)は、創業2005年。ベビー用品コマース Diapers.com を中心に、Soap.com、BeautyBar.com を運営している急成長ベンチャーだ。特にDiapers.comの成長は驚異的で、わずか創業4年にもかかわらず、2009年売上で180百万ドル(約146億円)、2010年売上は300百万ドル(約244億円)は達すると見込まれている。【急成長ベビー用品コマース　Diapers.com】ちなみに、Amazonの大型買収は、900百万ドル(約732億円)を投入したZappos以来だ。・アマゾンが800億かけても買収したかった「ザッポスの奇跡」 (12/7) このZapposとDiapersは、巨
manboubird 2010/11/21
diapers

amazon

M&A

algorithm
リンク
PPJoinとPPJoin+の検証｜社内NEET宣言
社内NEET宣言文学部出身なのにIT企業で研究開発をすることになった社員のブログです。 PR Profile [ルームを見る|なうを見る] ニックネーム：just do neet 性別：男性誕生日：さだまさしがソロデビューしたあたり出身地：神奈川県自己紹介： NEETは豊かさの象徴だから進んでNEETになるべきです。ブログジャンル：エンジニア/ニートメッセージを送るアメンバーになるプレゼントを贈る [Publish] Calendar <<August>> S M T W T F S 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Theme ブログ ( 25 ) Java ( 29 ) 職場の風景 ( 30 ) インターネット ( 17 ) Unix系O
manboubird 2010/10/31
similarityJoin

algorithm
リンク
ダイクストラ法（最短経路問題）
ダイクストラ法 (Dijkstra's Algorithm) は最短経路問題を効率的に解くグラフ理論におけるアルゴリズムです。スタートノードからゴールノードまでの最短距離とその経路を求めることができます。アルゴリズム以下のグラフを例にダイクストラのアルゴリズムを解説します。円がノード，線がエッジで，sがスタートノード，gがゴールノードを表しています。エッジの近くに書かれている数字はそのエッジを通るのに必要なコスト（たいてい距離または時間）です。ここではエッジに向きが存在しない（＝どちらからでも通れる）無向グラフだとして扱っていますが，ダイクストラ法の場合はそれほど無向グラフと有向グラフを区別して考える必要はありません。ダイクストラ法はDP（動的計画法）的なアルゴリズムです。つまり，「手近で明らかなことから順次確定していき，その確定した情報をもとにさらに遠くまで確定していく
manboubird 2010/10/23
Dijkstra

algorithm

graph
リンク
前のページ 2 3 4 5 6 7 8 9 10 11 次のページ