[B! java][JAVA][hadoop] Oooのブックマーク

Ooo id:Ooo

javaとJAVAとhadoopに関するOooのブックマーク (14)

IBM Developer
IBM Developer is your one-stop location for getting hands-on training and learning in-demand skills on relevant techno logies such as generative AI, data science, AI, and open source.
Ooo 2011/12/22
apache

mahout

apache mahout

hadoop

java
リンク
Apache Mahoutで機械学習してみるべ - 都元ダイスケ IT-PRESS
Mahoutシリーズ目次（随時更新）非分散レコメンデーション Apache Mahoutで機械学習してみるべ - 都元ダイスケ IT-PRESS （これ）レコメンデーションの簡単な原理を視覚的に把握してから実際に計算してみる - 都元ダイスケ IT-PRESS 機械学習における重大な"仮定"と、アルゴリズムの評価 - 都元ダイスケ IT-PRESS 分散レコメンデーション Mahoutで分散レコメンド(1) - 都元ダイスケ IT-PRESS Mahoutで分散レコメンド(2) - 都元ダイスケ IT-PRESS Mahoutで分散レコメンド(3) - 都元ダイスケ IT-PRESS クラスタリング今度はMahoutでクラスタリング - 都元ダイスケ IT-PRESS 今度はMahoutでクラスタリング(ソース編) - 都元ダイスケ IT-PRESS では、本文いきます。 Apach
Ooo 2011/03/10
hadoop

apache

Mahout

java
リンク
Ashigelコンパイラ勉強会の感想 - ひしだまの変更履歴
ひしだまＨＰの更新履歴。主にＴＲＰＧリプレイの元ネタ集、プログラミング技術メモと自作ソフト、好きなゲームや音楽です。昨日『Ashigelコンパイラの勉強会』に参加してきました。スライド：Inside of Asakusa DSL Togetter：#ashigel Ashigelコンパイラの勉強会自分が書いてきたメモはほとんどスライドの写しなので、今回は感想だけ書きます。（というか全部理解できたとは言いがたいので、感想しか書けないというか(苦笑)）まず「DSL」という言葉について。自分は最近Scalaを勉強していてScalaの本でよく「DSLの例」を見かけていた。Scala上に新しい構文っぽいものを作り上げ、それを使って目的の記述を行う形式。なので漠然とAsakusa DSLもそんな感じかと思っていたんだけど、全然違った＾＾； Asakusa DSLは大きく三層に分かれていて
Ooo 2011/03/01
hadoop

DSL

java

scala
リンク
IBM Developer
IBM Developer is your one-stop location for getting hands-on training and learning in-demand skills on relevant techno logies such as generative AI, data science, AI, and open source.
Ooo 2011/02/18
hadoop

Java

mapreduce
リンク
Hadoop MapReduceプログラムを解剖する
CodeZine編集部では、現場で活躍するデベロッパーをスターにするためのカンファレンス「Developers Summit」や、エンジニアの生きざまをブーストするためのイベント「Developers Boost」など、さまざまなカンファレンスを企画・運営しています。
Ooo 2010/11/30
hadoop

java

mapreduce

apache
リンク
Hadoopのジョブのパフォーマンスチューニング
Hadoop 0.21ではCounterでGCに使っている時間が見れるようになりました。こんな感じです。この例では5秒程度ですが、ジョブによってはもっとGCに時間を使っている場合があり、もっと詳細を調べてチューニング出来ないかという話です。まずはGCのログを取ります。 <name>mapred.child.java.opts</name> <value>-Xloggc:/tmp/hadoop-mikami/@taskid@.gc -Xmx1024m</value> このように-Xloggc で指定した場所にログを取れます。 @taskid@ には attempt_201010311624_0037_m_000000_0 みたいな感じでattempt_id が入ります。以下が先程のジョブのあるMapタスクでのGCログです 0.164: [GC 3072K->416K(889
Ooo 2010/11/05
hadoop

performance

java
リンク
Legacy Communities - IBM Community
If you’re looking for a developerWorks forum — Don't panic! You are in the right place. You are here because specific IBM developerWorks forums, blogs and other Connections content have been decommissioned. This page will help you find the content you are looking for, get answers to your questions, and find a new community to call home. Where am I? You are on the IBM Community area, a collection o
Ooo 2010/08/18
hadoop

IBM

java
リンク
KarmaSphereでおじさんにもMapReduce(Java)できた - masayang's diary
ここ数年Javaからは遠ざかっていた。理由は色々だけど、なんか面倒くさいとか、あの辺が面倒だなとか、annotationsがなんか不気味で面倒っぽいなとか、まあそういうことで。あとコンパイルしてjar作ってとか。なんか昔その物じゃないですか。あ、エディタはフルスクリーンなの？　カード穿孔機は不要なの？　そりゃすごい。そういうこともあって最近遊んでいるMapReduceはPythonでストリーミングのを書くことでほぼ用は足りているのだけど、この先もしかしたらJavaでしか実現できない状況に追い込まれるかもしれん。それをガリガリとコードで書くのかPigとやらで実現しちゃうのかはわからんが、でもまあ原理を突き詰めるためにコードで苦労しておくのは損はないかな、と。その場合はJavaですよやっぱ。でもね、Javaって面倒じゃないですか。あの辺とかその辺とか。そんな自分の脳裏にKarmaSpher
Ooo 2010/07/20
mapreduce

hadoop

Java

eclipse
リンク
GitHub - nathanmarz/cascalog: Data processing on Hadoop without the hassle.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
Ooo 2010/07/18
hadoop

clojure

Java
リンク
IBM Developer
IBM Developer is your one-stop location for getting hands-on training and learning in-demand skills on relevant techno logies such as generative AI, data science, AI, and open source.
Ooo 2010/07/01
hadoop

Java
リンク
1台構成のHadoopを30分で試してみる(Ubuntu + Cloudera)
(参考) Cloudera社のHadoopパッケージの情報 http://archive.cloudera.com/docs/ 必要なもの・UbuntuやdebianのLinux環境1台(手元ではUbuntu Server 11.04/10.04/9.10/8.04, debian 5あたりで試していますが、他バージョンでも大丈夫だと思います) ・インターネット接続・Sun(Oracle)のJavaパッケージ(aptでインターネットからインストール) ・Cloudera社のCDH3のHadoopパッケージ(aptでインターネットからインストール) 作業手順 1. インストール: Linux環境にて、rootで作業します。 sudo su 1-1. Sun(Oracle)のJavaを入れます。(Sun(Oracle)のものが必要です。) ※ ここで、ubuntu 10や11の人は/etc
Ooo 2010/05/12
hadoop

ubuntu

Java

linux
リンク
Cascading
Please note that all new project news and releases have moved to https://cascading.wensel.net The Cascading Ecosystem is a collection of applications, languages, and APIs for developing data-intensive applications. At the ecosystem core is Cascading, a Java API for defining complex data flows and integrating those flows with back-end systems, and a query planner for mapping and executing logical f
Ooo 2010/05/08
hadoop

Java
リンク
Post by @iteman
Googleの基盤クローン Hadoopについて View more presentations from kzk_mover.
Ooo 2010/04/24
google

hadoop

Java
リンク
Manageability - Open Source Grid and Cluster Computing Frameworks Written in Java
You are here: Home » blog » stuff » Open Source Grid and Cluster Computing Frameworks Written in Java Had a little bit of a conundrum on what title to give this list. I gather that "Grid" and "Cluster" computing would resonate better than "Distributed" and "Parallel". There's been a lot of activity lately in this area, however the one I've keenly interested in are the ones that integrate with Am
Ooo 2010/04/13
grid

cluster

hadoop

java

distributed systems
リンク
1