[B! pig] wyukawaのブックマーク

Hadoop and the Data Scientist

Beginner must-see! A future that can be opened by learning HadoopDataWorks Summit

wyukawa 2012/10/20

このスライドのp16-p20にCUBEの話があるな

リンク

In an era where artificial intelligence (AI) is reshaping enterprises across the globe—be it in healthcare, finance, or manufacturing—it’s hard to overstate the transf ormation that AI has had on businesses, regardless of industry or size. At Cloudera, we recognize the urgent need for bold steps to harness this potential and dramatically accelerate the time to […] Read blog post

wyukawa 2012/10/18

Nested FOREACH and CROSSは地味に重要かも

pig

リンク

[PIG-2353] RANK function like in SQL - ASF JIRA

wyukawa 2012/10/18

RANKオペレーターが実装されそうなのかな

pig

リンク

Introducing the CUBE operator for Apache Pig | Arnab Nandi

Guest post by Prasanth Jayachandran , who has been working on implementing CUBE support for Pig, as part of the large-scale distributed cubing effort. Update: As per Dmitriy’s tweet: …the naive implementation is in. The scala ble count distinct impl is pending 0.11 branching, will go into 0.12. The next version of Apache Pig will support the CUBE operator ( patch available here ). The CUBE operator

wyukawa 2012/10/07

Pig 0.11にCUBE Operatorが入るのね。スケーラブルなCUBE Operatorは0.12からっぽいけど。

pig

リンク

[PIG-2167] CUBE operation in Pig - ASF JIRA

Computing aggregates over a cube of several dimensions is a common operation in data warehousing. The standard SQL syntax is "GROUP relation BY dim1, dim2, dim3 WITH CUBE" – which in addition to all dim1-2-3, produces aggregations for just dim1, just dim1 and dim2, etc. NULL is generally used to represent "all". A presentation by Arnab Nandi describes how one might implement efficient cubing in Ma

wyukawa 2012/10/04

PigにCUBEって入るのかな

pig

リンク

GitHub - LinkedInAttic/datafu: Hadoop library for large-scale data processing, now an Apache Incubator project

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert

wyukawa 2012/10/04

CDH4.1から入ったPigのUDFライブラリのDataFuはLnkedInが作ってるのか。Pigは玄人向けツールなのかも。

pig

リンク

[PIG-1618] Switch to new parser generator technology - ASF JIRA

wyukawa 2012/09/28

0.8まではJavaCCを使っていたけれども0.9からANTLRに変わったのか

pig

リンク

Hive & Pig

Hive & Pig Two ways of doing one thing Or One way of doing two things Ashutosh Chauhan Who am I? • Pig Committer & PMC Member • Hive Committer & PMC Member • Hcatalog Committer & PPMC Member • ASF Member • Software Engineer at HortonWorks Two ways of doing same thing • Both generate map-reduce jobs from a query written in higher level language. • Both frees users from knowing all the little secret

wyukawa 2012/09/18

PDF注意。PigとHiveの比較スライド。バックエンドはPigでフロントエンドがHiveかな。

Hive
Pig

リンク

[PIG-2228] support partial aggregation in map task - ASF JIRA

wyukawa 2012/09/12

pig.exec.mapPartAggをtrueにするとmap aggregationするらしい

pig

リンク

Practical Pig and PigUnit (Michael Noll, Verisign)

This talk was held at the second meeting of the Swiss Big Data User Group on July 16 at ETH Zürich. http://www.bigdata-usergroup.ch/it em/296477

wyukawa 2012/09/08

pigを使った開発フローに関する資料

pig

リンク

Practical Problem Solving with Apache Hadoop & Pig

The document discusses a presentation about practical probl em solving with Hadoop and Pig. It provides an agenda that covers introductions to Hadoop and Pig, including the Hadoop distributed file system, MapReduce, performance tuning, and examples. It discusses how Hadoop is used at Yahoo, including statistics on usage. It also provides examples of how Hadoop has been used for applications like lo

wyukawa 2012/09/06

p158からpigのことが書かれている。最後にはSQLとの対応関係まである。約2年前の資料だけど良い資料ですな。

Hadoop
pig

リンク

Cloudera Blog

Riding the wave of the generative AI revolution, third party large language model (LLM) services like ChatGPT and Bard have swiftly emerged as the talk of the town, converting AI skeptics to evangelists and transf orming the way we interact with techno logy. For proof of this megatrend look no further than the instant success of ChatGPT, […] Read blog post

wyukawa 2012/09/05

Hortonworksにインターンに行った人がPigの性能改善についていろいろやったらしい。Hiveとの性能比較もある。

pig

リンク

GitHub - romainr/PigEditor: Eclipse plugin for Apache Pig

wyukawa 2012/08/31

PigEditorはXtextを使っているようだ

Pig

リンク

PigTools - Apache Pig - Apache Software Foundation

wyukawa 2012/08/31

PigのEclipseプラグインって開発止まっているように見えるな。。。

Pig

リンク

Hadoop Pig の使いどころ - Tech-Sketch

「PigとHive何が違うの？」「Difference between Pig and Hive? Why have both?(PigとHive何が違うの？)」という質問を、先日、StackOverFlowで見かけました。恐らくHadoopを触ると一度は疑問に思う事ではではないでしょうか。 PigとHiveは、共にSQLライクな記法でMapReduceを書けるDSLですが、利用者数においてはHiveに軍配が上がっているようにみえます。一方で、「Pigをもっと早く試せば良かった」というお話を伺うこともあり、有用（かもしれない）ツールであれば、正しく理解しておいた方がよさそうです。というわけで、ここではPigの活用を探ります。 Pigの性能 Pigが今一つ利用されていないのは、SQLとの親和性に加え、性能面で、「Java MapReduce＞Hive＞Pig」という傾向があるからで

wyukawa 2012/08/28

へー、PigってHiveより遅いんだ。ただメタデータが要らないので導入しやすいよな。

リンク

はてなブックマーク

タグ

関連タグで絞り込む (8)

pigに関するwyukawaのブックマーク (16)

お知らせ

今週のはてなブックマーク数ランキング（2024年9月第3週）

今週のはてなブックマーク数ランキング（2024年9月第2週）

月間はてなブックマーク数ランキング（2024年8月）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス