umitanukiのブックマーク - はてなブックマーク

Ultimate Guide to Funnel Optimization

umitanuki 2015/04/15

リンク

Modern SQL in PostgreSQL

SQL has gone out of fashion lately—partly due to the NoSQL movement, but mostly because SQL is often still used like 20 years ago. As a matter of fact, the SQL standard continued to evolve during the past decades resulting in the current release of 2016. In this session, we will go through the most important additions since the widely known SQL-92. We will cover common table expressions and window

umitanuki 2015/02/14

リンク

PFI Seminar 2012/03/15 カーネルとハッシュの機械学習

Direct feedback alignment provides learning in Deep Neural Networks

umitanuki 2014/12/09

リンク

浮動小数点（IEEE754）を圧縮したい＠dsirnlp#4

2015年9月18日開催　GTC Japan 2015 講演資料エヌビディア合同会社プラットフォームビジネス本部シニアCUDA エンジニア森野慎也 CUDA Tookitでは、Nsight、Visual Profilerなどの開発ツールが、標準で提供されています。本セッションでは、これらのツールを用いたデバッグ・プロファイリングの基本操作について、説明します。また、事例を用い、効率のよいデバッグ法、プロファイリング時の基本的な確認ポイントもあわせて紹介します。プラットフォームは、Windows、Linuxの両者が対象となります。

umitanuki 2014/05/12

リンク

Leadership Without Management: Scaling Organizations by Scaling Engineers

My talk at Surge 2013. Video is at http://www.youtube.com/watch?v=bGkVM1B5NuI Caution: Should not be consumed by stack-ranking six-sigma black belts with fragile constitutions.Read less

umitanuki 2013/10/04

リンク

Hive/Pigを使ったKDD'12 track2の広告クリック率予測

1. Hive/Pigを使ったKDD'12 track2 の広告クリック率予測油井誠 m.yui@aist.go.jp 産業技術総合研究所情報技術研究部門 Twitter ID: @myui スライド http://www.slideshare.net/myui/dsirnlp-myuilt 1 http://goo.gl/Ulf3A 2. KDDcup 2012 track2 • 検索ログを基に、検索エンジンの広告のクリック率(Click-Through Rate)を推定するタスク – 中国の3大検索エンジンの一つsoso.comの実データ • 検索語などはHash値などを利用してすべて数値化されている – Trainingデータ(約10GB+2.2GB, 15億レコード） – Testデータ（約1.3GB, 2億レコード） • 学習データの1.33割が評価用データセット –

umitanuki 2013/06/16

machine learning

リンク

What's New and Upcoming in HDFS - the Hadoop Distributed File System

Todd Lipcon gives shares at the Federal Big Data Forum what is new and upcoming in HDFS (Hadoop Distributed File System).Read less

umitanuki 2013/02/24

hadoop

リンク

Analytical Queries with Hive: SQL Windowing and Table Functions

Analytical Queries with Hive: SQL Windowing and Table Functions Hive Query Language (HQL) is excellent for productivity and enables reuse of SQL skills, but falls short in advanced analytic queries. Hive`s Map & Reduce scripts mechanism lacks the simplicity of SQL and specifying new analysis is cumbersome. We developed SQLWindowing for Hive(SQW) to overcome these issues. SQW introduces both Window

umitanuki 2012/11/05

hadoop
sql

リンク

Machine Learning with Hadoop

Sangchul Song and Thu Kyaw discuss machine learning at AOL, and the challenges and solutions they encountered when trying to train a large number of machine learning models using Hadoop. Algorithms including SVM and packages like Mahout are discussed. Finally, they discuss their analytics pipeline, which includes some custom components used to interoperate with a range of machine learning librarie

umitanuki 2012/10/28

machine learning

リンク

大規模画像認識とその周辺

2. Contents }  大規模画像データで出来ることの例 }  一般物体認識の紹介 }  大規模化の流れと最近の手法について }  大規模一般物体認識コンペティション }  他分野との融合的領域など 3. 大規模画像データの時代 }  Webサービスへの画像投稿は日常の一部 }  Flickr： 60億枚の画像（2011年） }  Facebook: 毎年30億枚画像投稿 }  Youtube: 毎日約8年分の動画がアップロード }  何らかのメタ情報が付与される場合も多い }  タグ、コメント、EXIF、位置情報、・・・ }  これらの大量のデータを用いることで、従来考えられなかったさまざまなアプリケーションが登場している 4. 画像補完 }  Scene completion using millions of photographs [Hays et

umitanuki 2012/04/24

machine learning

リンク

Jenkins＠EC2 による継続的インテグレーション

This document discusses using Jenkins to run continuous integration jobs on Amazon EC2 instances. It describes how to launch a Jenkins slave node on an EC2 instance using the EC2 API tools and SSH. The Jenkins slave node runs jobs and reports back to the Jenkins master, and the EC2 instance can be automatically started before jobs and stopped afterwards to avoid costs when not in use.Read less

umitanuki 2012/04/20

リンク

Apache Mahout - Random Forests - #TokyoWebmining #8

The document discusses social media, social graphs, personality modeling, data mining, machine learning, and random forests. It references social media, how individuals connect through social graphs, modeling personality objectively, extracting patterns from data through data mining and machine learning techniques, and the random forests algorithm developed by Leo Breiman in 2001.Read less

umitanuki 2012/03/30

machine learning

リンク

目grep入門 +解説

2. いいわけ 1 • なぜかよくわからないけど好評だったらしい • でも – いみわかんねｗｗｗ – 後半が意味不明 – つーかそれ目grep言わないでしょ • というツッコミが… • 頭おかしいというのがうけたらしい? – 全然おかしくないよ! • が、人によっては「ためになる資料です！！」というコメントもあったり 2 / 83

umitanuki 2012/03/17

リンク

Jubatusのリアルタイム分散レコメンデーション@TokyoNLP#9

1. Jubatusのリアルタイム分散レコメンデーション 2012/02/25@TokyoNLP 株式会社Preferred Infrastructure 海野裕也 (@unnonouno) 2. ⾃自⼰己紹介 l  海野　裕也 (@unnonouno) l  unno/no/uno l  ㈱Preferred Infrastructure 研究開発部 l  検索索・レコメンドエンジンSedueの開発など l  専⾨門 l  ⾃自然⾔言語処理理 l  テキストマイニング l  Jubatus開発者

umitanuki 2012/02/26

machine learning

リンク

Hadoopソースコードリーディング8/MapRを使ってみた

1. MapR & マルチテナント (include Mesos検証) Hadoopソースコードリーディング第8回 2012/02/08 (水) 中野猛 (RECRUIT) @tf0054 - 発表内容 - 高林貴仁 (RECRUIT) 1. 性能検証 @tatakaba 大坪正典 (NSSOL) 2. 機能検証（マルチテナント検証） @tsubo0423 3. リクルートにおけるMapRの評価 Copyright(C)2012 Recruit Co.,Ltd All rights reserved 3. DOC.ID 2012/02/08 １. 性能検証  検証内容サマリ処理のバッチは中古車サイトで実際に行われていた3つ処理をHiveに置き換え、非パーティション＋非圧縮とパーティション＋圧縮の2パターンを測定し検証の実施 VCA01 – 20Tableから、5つのTem

umitanuki 2012/02/24

リンク

lsh

2. ( 最 ) 近傍点探索 ( Nearest Neighbor Search) とはいわゆる、特徴空間内での類似データ探索二種類の問題が考えられる定義 ℜ d 空間上の点集合 P が与えられた場合最近傍点探索クエリ点 q に対し、 p∈P で、 ||p-q|| を最小とする点 p を求める問題 r- 近傍点探索クエリ点 q に対し、 p∈P で、 ||p-q||<r となる点 p を ( 存在するのならば ) 列挙する問題 3. 近傍点探索問題近傍点探索アルゴリズムは、以下のようなタスクにおいて利用されるインスタンスベース学習(k-近傍法) クラスタリングデータセグメンテーションデータベース検索最短経路木探索(Minimum Spanning Tree) データ圧縮類似データ検索 4. 近傍点探索アルゴリズム最も単純なものは、クエリ点 q と、 p∈P の点全

umitanuki 2012/01/26

machine learning

リンク

Mahoutにパッチを送ってみた

3. 今日は皆既月食 • 11年振りの好条件、次のチャンスは2030 年1月31日らしい – 部分食始まり： 21時54分 – 皆既食始まり： 23時5分 – 皆既食最大: 23時31分 – 皆既食終わり： 23時58分 4. 流れ • 実装寄りの勉強会ということなので・・・ • Mahoutがどのようにアルゴリズムを MapReduceで実装しているかをひたすら解説 • 教師あり学習のみ • 送ったパッチについて紹介

umitanuki 2011/12/10

machine learning

リンク

Jubatusにおける大規模分散オンライン機械学習

1. Jubatusにおける⼤大規模分散オンライン機械学習 2011/12/08 @⼤大規模データ処理理勉強会株式会社Preferred Infrastructure 海野　裕也 (@unnonouno) 2. ⾃自⼰己紹介 l  海野　裕也 (@unnonouno) l  Preferred Infrastructure (PFI) 研究開発部⾨門リサーチャー l  社員20⼈人くらい l  検索索・レコメンドエンジンSedueの開発など l  専⾨門 l  ⾃自然⾔言語処理理 l  テキストマイニング l  Jubatusプロジェクト内での役割 l  主に特徴抽出エンジン、機械学習エンジンの研究開発 2 3. Big Data ! l  データはこれからも増加し続ける多いことより増えていくということが重要 l データ量量の変化に対応できるスケーラブルなシ

umitanuki 2011/12/09

machine learning

リンク

Start Vim script @Ujihisa.vim 2011/11/19

The document discusses Vim script and provides an introduction to writing Vim script. It begins with an overview of Vim script and discusses using :help to learn syntax. It provides an example function and use of :command. The document encourages learning from good Vim scripts and provides some examples. It discusses uses of Vim script including ftplugin, plugins, and libraries.Read less

umitanuki 2011/11/20

リンク

研究動向から考えるx86/x64最適化手法

2. Today Agenda 本日の概要 CPU上のマルチコア化や，各種ペナルティの増大に対して，ペナルティの軽減，または完全に排除するデータ構造やアルゴリズムの研究に関する話題 ---- 本日は2000年以降のIntel Lab.や関連研究者による成果の俯瞰が目的本スライドの目的は以下・マルチコア/メニーコア時代における性能改善観点の理解・具体例でのx86/x64最適化アルゴリズムの概要理解 ⇒探索，整数圧縮，並び替え処理 2 3. Today Agenda • 自己紹介 • Intel Lab.とは？ • 最近の研究動向 • 研究分野における最適化の観点 – キャッシュミス/DTLBミスの低減化 – 分岐排除 – メモリバンド使用量の考慮 • 具体例1: SIMD命令を利用した探索の分岐排除 • 具体例2: 整数の固定長圧縮によるPipelineハザードの回避 • 具体例3:

umitanuki 2011/10/02

x86

リンク

はてなブックマーク

タグ

ブックマーク / www.slideshare.net/slideshow (30)

お知らせ

今週のはてなブックマーク数ランキング（2024年9月第5週）

今週のはてなブックマーク数ランキング（2024年9月第4週）

今週のはてなブックマーク数ランキング（2024年9月第3週）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス