[B! dimension][machinelearning] yassのブックマーク

yass id:yass

dimensionとmachinelearningに関するyassのブックマーク (12)

Feature Selection with R / in JP
7 R packages that might be helpful in selecting important features. Slides are in Japanese.Read less
yass 2014/06/29
feature selection

dimension

machinelearning

r
リンク
今年のSIGKDDベストペーパーを実装・公開してみました - Preferred Networks Research & Development
毎日暑いですね。比戸です。ちょうど今週シカゴで開かれていたSIGKDD2013でBest research paperに選ばれたEdo Liberty氏 (Yahoo! Haifa Labs)の”Simple and Deterministic Matrix Sketching”のアルゴリズムを実装して公開してみました。元論文PDFは著者サイトから、私が書いたPythonコードはGithubからそれぞれ入手できます。 SIGKDD (ACM SIGKDD Conference on Knowledge Discovery and Data Mining)はACM主催で行われる、知識発見＆データマイニングにおけるトップ会議です。最近は機械学習との境目が曖昧になってきましたが、査読時には理論的な新しさだけでなく、実データ（特に大規模データ）を使った実験での評価が必要とされるのが特徴です。
yass 2013/08/16
"SIGKDD (ACM SIGKDD Conference on Knowledge Discovery and Data Mining)はACM主催で行われる、知識発見＆データマイニングにおけるトップ会議/Matrix sketchとは簡単に言うと、元の大きなNxM行列Aを、はるかに小さなℓxM行列B（N >> ℓ）で近似"

matrix

k-means

machinelearning

sketch

PCA

dimension
リンク
Vol.27 No.3 (2012/05) Latent Topic Model (潜在的トピックモデル) | 人工知能学会 (The Japanese Society for Artificial Intelligence)
私のブックマーク Latent Topic Model (潜在的トピックモデル)東京大学情報基盤センター助教佐藤一誠 (Issei Sato) URL: http://www.r.dl.itc.u-tokyo.ac.jp/~sato/ 1.はじめに近年、Topic modelと呼ばれる確率的潜在変数モデルが、機械学習とデータマイニングの境界分野で盛んに研究されています。また、Topic modelは、自然言語処理、画像処理、Web解析など様々な応用分野でも多くの適用例が報告されています。ここでは、Topic modelの研究に関する情報を紹介します。 2.国際会議機械学習およびデータマイニングでは、主に国際会議で最先端の議論がされているため、主要国際会議を把握しておくことが重要です。Topic modelの研究では、主に以下の国際会議が重要視されています。 Neural Info
yass 2013/05/09
topic

LDA

dimension

machinelearning
リンク
What is the "hashing trick"? - MetaOptimize Q+A
Let's say we want to design a function v = phi(x), which from a d-dimensional vector x = (x(1), x(2), ..., x(d)) outputs a new m-dimensional vector v, with m either greater or smaller than d. In other words, phi can be used either for reducing dimensionality of x (d > m) or for sparsifying x (m > d). One way to do so is to use a hash function h to map x(1) to v(h(1)), x(2) to v(h(2)), ..., x(d) to
yass 2013/04/29
hash

dimension

hashing trick

machinelearning
リンク
H24:Introduction to Statistical Topic Models
統計数理研究所 H24年度公開講座「確率的トピックモデル」サポートページ講師: 持橋大地 (統数研), 石黒勝彦 (NTTコミュニケーション科学基礎研究所) 講義スライド持橋分 (2013/1/15) [PDF] (12MB) 石黒分 (2013/1/16) [PDF] ソフトウェア UM (Unigram Mixtures) um-0.1.tar.gz DM (Dirichlet Mixtures) dm-0.1.tar.gz, dm-0.2.tar.gz PLSI (Probabilistic Latent Semantic Indexing) plsi-0.03.tar.gz (外部サイト) LDA (Latent Dirichlet Allocation) lda-0.1.tar.gz 参考文献「私のブックマーク: Latent Topic Model (潜在的トピックモデ
yass 2013/04/29
LDA

machinelearning

topic

dimension
リンク
Speeding up Latent Dirichlet Allocation
The code to our LDA implementation on Hadoop is released on Github under the Mozilla Public License. It’s seriously fast and scales very well to 1000 machines or more (don’t worry, it runs on a single machine, too). We believe that at present this is the fastest implementation you can find, in particular if you want to have a) 1000s of topics, b) a large dictionary, c) a large number of documents,
yass 2013/04/29
LDA

machinelearning

dimension
リンク
Dimensionality reduction for sparse binary data - FastML
yass 2013/04/29
machinelearning

dimension

LDA

LSI

PCA

gensim
リンク
Hashing for Collaborative Filtering
This is a follow-up on the hashing for linear functions post. It’s based on the HashCoFi paper that Markus Weimer, Alexandros Karatzoglou and I wrote for AISTATS'10. It deals with the issue of running out of memory when you want to use collaborative filtering for very large probl ems. Here’s the setting: Assume you want to do Netflix-style collaborative filtering, i.e. you want to estimate entries
yass 2013/04/29
machinelearning

dimension

hashing trick

collaborative filtering

recommend
リンク
Google Code Archive - Long-term storage for Google Code Project Hosting.
Code Archive Skip to content Google About Google Privacy Terms
yass 2013/04/17
machinelearning

LSI

dimension
リンク
Dimensionality reduction for sparse binary data - an overview - FastML
yass 2013/03/29
" Overall, for big data LSI in Gensim might be a good first choice. It’s online and reasonably fast. For even bigger data, probably LDA in VW. Vowpal Wabbit has built-in super-fast LDA, which is interesting. "

machinelearning

dimension

LDA

TF-IDF

ICA
リンク
Latent Dirichlet Allocation ゆるふわ入門 - あらびき日記
この記事は abicky.net の Latent Dirichlet Allocation (LDA) ゆるふわ入門に移行しました
yass 2013/03/13
LDA

MachineLearning

dimension
リンク
潜在的意味インデキシング（LSI）徹底入門 - あらびき日記
この記事は abicky.net の潜在的意味インデキシング（LSI）徹底入門に移行しました
yass 2012/03/26
LSI

machinelearning

matrix

nlp

dimension
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx