manboubirdのブックマーク / 2015年7月26日

manboubird id:manboubird

2015年7月26日のブックマーク (22件)

ANNIE on Broadway: Maybe
manboubird 2015/07/26
Money Makes the World Go Round

musical

cabaret
リンク
How to find a startup worth working for – Mattermark
manboubird 2015/07/26
startup
リンク
GitHub - CommBank/grimlock: library for performing data-science and machine learning related data preparation, aggregation, manipulation and querying tasks at scale using Scalding or Spark
Overview The grimlock library can be used for performing data-science and machine learning related data preparation, aggregation, manipulation and querying tasks. It makes it easy to perform such tasks as: Normalisation/Standardisation/Bucketing of numeric variables; Binarisation of categorical variables; Creating indicator variables; Computing statistics in all dimensions; Generating data for a v
manboubird 2015/07/26
commBank

grimlock

scalding

Spark

dataScience

tool
リンク
Distributable and organized dotfiles with homeshick and mr
manboubird 2015/07/26
bash

dotfile

mr
リンク
fish shell
Finally, a command line shell for the 90s fish is a smart and user-friendly command line shell for Linux, macOS, and the rest of the family. fish will delight you with features like tab completions and syntax highlighting that just work. It will make you exclaim "Of course fish does that!".
manboubird 2015/07/26
fishshell
リンク
GitHub - CommBank/humbug: Humbug is a very simple scrooge replacement for Thrift structs with more than 254 fields
manboubird 2015/07/26
humbug

commBank

thrift

scrooge
リンク
GitHub - CommBank/thermometer: A micro-test framework for scalding pipes to make sure you don't get burnt
manboubird 2015/07/26
scalding

commBank

testing

hive
リンク
SQL style guide by Simon Holywell
Overview General Do Avoid Naming conventions General Tables Columns Aliasing or correlations Stored procedures Uniform suffixes Query syntax Reserved words White space Indentation Preferred formalisms Create syntax Choosing data types Specifying default values Constraints and keys Designs to avoid Appendix Reserved keyword reference Column data types SQL style guide Overview You can use this set o
manboubird 2015/07/26
sql

styleGuide
リンク
検索エンジンで上位表示をさせるためのSEO対策が、最近ではただの精神鍛錬になりつつあるという話。長文+追記が現時点では最強です。 - SONOTA
photo by The Logo Smith 最近、検索エンジンで上位表示をさせるためのSEO対策が、ただの精神鍛錬になりつつあるなーという感じがしています。この理由は単純な話で、とにかく文字数の多い長文さえ書ければ上位表示できてしまう実情があるため。記事内容がどんなにつまらなくても、たとえ単調な説明が延々と続くだけの記事であったとしても、更には「そんな情報ほしがってる人いる？」というものが書いてあったとしても、内容に一貫性のある長文であれば検索エンジンに評価されてしまうのです。読んでいて面白い短文：検索に弱い読んでいて面白くない長文：検索に強い長文が検索に強いという例：良い例が下記の情報ですよね（記事中に実証実験があります）。実際にこういう思考回路＆テクニックを駆使さえすれば、検索上位を占拠できてしまう状況があるのです。書籍化＆３０万PV達成したので、ブログのアクセス増やす
manboubird 2015/07/26
seo

google
リンク
「日本軍のインテリジェンス　なぜ情報が活かされないのか」小谷賢著
太平洋戦争における日本は通信を傍受され、暗号を解読され、偽情報に撹乱され、連合軍の兵力を見誤り、情報分析を疎かにして慢心と理想論とで作戦を立てて失敗を繰り返し・・・と情報戦で完敗したが、本書は戦前日本の情報活動はどのようなものだったのか、どこに問題があったのかを概観した一冊である。基本的な用語がおさえてあるのでインテリジェンス入門書として有用だ。生情報やデータが「インフォメーション」、「インフォメーション」を分析・加工した情報が「インテリジェンス」で、「インテリジェンスの本質は、無数のデータから有益な情報を抽出、加工することによって政策決定サイドに『政策を企画・立案及び遂行するための知識』を提供することにある」（P7）。国益・国家戦略に基づく情報要求「リクワイアメント」が政策・作戦サイドから情報収集・分析（インテリジェンス）サイドに出され、これに対してインテリジェンスサイドは多様な情報を
manboubird 2015/07/26
book

review

war

japanese

intelligence
リンク
『権力の終焉』が、競争のルールを書き換える - HONZ
- 36 users
- honz.jp
- 学び
国民国家システムは終焉を迎えつつあり、国家権力は衰退していると多くの書籍は言う。その一方で新聞やネットには、強行採決、右傾化というキーワードが飛び交う。資本主義システムでは経済の格差が広がる一方であることが示される傍らで、政治権力における格差の縮小傾向を示す様々な出来事も起こっている。これらは一体何を意味するのだろうか？スマホやソーシャルメディアといったITの進化によって、権力が大きなものから小さなものへとシフトする。それ自体は、疑いの余地もないことである。だがそれだけで全てを片付けるのは、少し部分的で表面的であるのかもしれない。権力とは、ほかの集団や個人の現在または将来の行動を命令したり、阻んだりする能力のことを指す。誰だって権力を手中に収め、世界を思うがままにしてみたいと願望はあるだろう。しかし、苦労して権力を手にしたところで、その力を行使できる範囲は限定的なものなっているーー
manboubird 2015/07/26
book

review
リンク
Avro Backed Hive Table in CDH5 (or CDH4.5): Encountered AvroSerdeException determining schema
You are using Hive and some of your tables are backed by the Avro Serde. You are seeing a lot of this: WARN avro.AvroSerdeUtils: Encountered AvroSerdeException determining schema. Returning signal schema to indicate probl em org.apache.hadoop.hive.serde2.avro.AvroSerdeException: Neither avro.schema.literal nor avro.schema.url specified, can't determine table schema Sometimes this WARN becomes an ER
manboubird 2015/07/26
avro

hive

troubleshooting
リンク
https://docs.cloudera.com/documentation/enterprise/latest.html?scroll=concept_h5r_mtb_gk_unique_1
manboubird 2015/07/26
avro

cdh
リンク
Cloudera Blog
In an era where artificial intelligence (AI) is reshaping enterprises across the globe—be it in healthcare, finance, or manufacturing—it’s hard to overstate the transf ormation that AI has had on businesses, regardless of industry or size. At Cloudera, we recognize the urgent need for bold steps to harness this potential and dramatically accelerate the time to […] Read blog post
manboubird 2015/07/26
cloudera

avro

recordBreaker
リンク
LiveReload(Python)のshellが便利すぎる
LiveReload-0.15から使えるようになったshellが便利。例えばHTML書くのにSass(Compass)+Jade+CoffeeScriptという組み合わせを使っているとそれぞれwatchしてコンパイルすることになるので端末を3つ無駄にするが、Livereloadでコマンドを定義するのもちょっと面倒。そんな時にはshellを使うといい #!/usr/bin/env python from livereload.task import Task from livereload.compiler import shell Task.add('sass/*.scss', shell('compass compile')) Task.add('index.jade', shell('jade index.jade'))
manboubird 2015/07/26
python

livereload
リンク
Python Livereload を使って Sphinx でドキュメントを書く - Qiita
Sphinx でドキュメントを書いているとソースファイル (reST) を HTML に変換するために make html コマンドを実行することが多いと思います。しばらく書いてみると、ソースファイルを変更する度にコマンドを実行して HTML を確認するためにブラウザをリロードして ... という操作は煩雑に思えてきます。 Python LiveReload を使うとファイルの変更を検知してブラウザを自動的にリロードできるようになります。設定スクリプトは Python で記述しますので、Python に関するドキュメントを書いている場合は頭の切り替えが要らない点がメリットと言えます。バージョンによって設定方法が違っており、各種 API も変わっています。バージョン 2.2.1 からはそろそろ Python 3.x でも使えるようになってきましたので、導入方法をまとめておきます。 dire
manboubird 2015/07/26
python

liveload

sphinx
リンク
Throttle Concurrent Builds
Throttling of runs by one or multiple category Throttling of multiple runs of the same AbstractProject job (not recommended) Throttling of runs by parameter values For each mode it is possible to setup global, label-specific, and node-specific limits for concurrent runs. If multiple throttling categories are defined, each requirement needs to be satisfied in order for the task to be taken off the
manboubird 2015/07/26
jenkins

plugin

lock
リンク
Exclusion
manboubird 2015/07/26
jenkins

plugin

lock
リンク
シェルスクリプト多重起動の防止（flock(2) を用い、極力可搬に） - Qiita
シェルスクリプトにおけるアドバイザリロックの実現方法にはいくつかありますが、ロックファイルの有無では、ファイルの消し損ねがよく起きてしまいます。また、pgrep(1) を用いる方法もよく見られますが、同じパスを含むコマンドが実行中であることは多く（特に、エディタで同スクリプトを編集中である場合など）、これまた確実性に欠きます。最終的にベストの方法は、flock(2) システムコールを使う方法のようです（レンタルサーバなどでは使えないこともあるようですが、まー、今どきならコンテナ環境とか安いですし…）。flock(1) コマンドが使える環境であれば、待ち・ノンブロックの選択、排他ロックの他に参照の共有ロックもできて便利ですが、当座は多重起動を防止したいだけですので、ノンブロックの排他ロックを用います（参考: 「~/bin - using flock to protect critical
manboubird 2015/07/26
flock

lock
リンク
flock(1)でシェルスクリプトの排他制御（Linux） - Qiita
シェルスクリプトで排他制御したいケースがたまにあります。「多重起動の抑制」「二重起動の禁止」などとされることもあります。よくやるのは pid ファイルを作成したりロックファイルを作成したりだと思いますが、「ファイルの有無」とか「プロセスIDのチェック」がどうも不安定に感じていました。ファイルシステムの排他ロックとか簡単に使えればいいのに…！調べてみると、flock(1) なんていう便利なものがあることがわかりました。 http://linux.die.net/man/1/flock http://luna2-linux.blogspot.jp/2011/08/flock.html 第二版あえて、「singleton.sh を source することでスクリプトを排他制御する」という構成にこだわる。 # 利用者はこのファイルを source します _LOCK_FILE=$0.lock
manboubird 2015/07/26
linux

flock

lock

bash
リンク
GitHub - CommBank/parlour: A Cascading-Sqoop Integration
manboubird 2015/07/26
sqoop

cascading
リンク
Discover opportunities at Pinterest
Pinterest is an equal opportunity employer and makes employment decisions on the basis of merit. We want to have the best qualified people in every job. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic under federal,
manboubird 2015/07/26
secor

pinterest

Kafka
リンク
- 2015年7月27日
- 2015年7月26日
- 2015年7月25日