no-fluffのブックマーク / 2007年7月15日

no-fluff id:no-fluff

2007年7月15日のブックマーク (14件)

Reading processing instructions and comments with ElementTree
no-fluff 2007/07/15
PIとコメントをElementTreeで扱うには。文書要素の外のPI、コメントにも対応

elementtree

python
リンク
ActiveState Community
At ActiveState, we use the Platform to build not only our popular open source language distributions, but also custom runtimes for our enterprise clients (i.e. builds containing just the language and packages their proje…
no-fluff 2007/07/15
ActiveState XSLT cookbook

xslt

cookbook
リンク
nonstandard XML character entities?
no-fluff 2007/07/15
ElementTreeで定義済み文字参照以外の実体参照を扱うには。htmlentitidefs使用。

python

elementtree
リンク
Beautiful Soup: We called him Tortoise because he taught us.
You didn't write that awful page. You're just trying to get some data out of it. Beautiful Soup is here to help. Since 2004, it's been saving programmers hours or days of work on quick-turnaround screen scraping projects. Beautiful Soup is a Python library designed for quick turnaround projects like screen-scraping. Three features make it powerful: Beautiful Soup provides a few simple methods and
no-fluff 2007/07/15
python

xml

html

beautifulsoup
リンク
98168 - (doe) <xsl:text disable-output-escaping="yes"> not working
Shared components used by Firefox and other Mozilla software, including handling of Web content; Gecko, HTML, CSS, layout, DOM, scripts, images, networking, etc. Issues with web page layout probably go here, while Firefox user interface issues belong in the Firefox product. (More info)
no-fluff 2007/07/15
Firefoxはdisable-output-escapingが使えない

firefox

xsl

xslt
リンク
Python and HTML Processing
Abstract Various Web surfing tasks that I regularly perform could be made much easier, and less tedious, if I could only use Python to fetch the HTML pages and to process them, yielding the information I really need. In this document I attempt to describe HTML processing in Python using readily available tools and libraries. NOTE: This document is not quite finished. I aim to include sections on u
no-fluff 2007/07/15
あとで読む
リンク
The ElementSoup Module
Fredrik Lundh | August 2006 The ElementSoup module is a (slightly experimental) wrapper for Leonard Richardson’s robust BeautifulSoup HTML parser, which turns the BeautifulSoup data structure into an element tree. The resulting combo is similar to ElementTidy, but a lot less picky. And therefore, a lot more practical. Which is good. Code (latest versions):ElementSoup.py [history] BeautifulSoup.py
no-fluff 2007/07/15
ElementTree + BeautifulSoup

python

xml

html

elementtree

beautifulsoup

elementsoup
リンク
Python/XML Libraries
XML package for Python The PyXML package is a collection of libraries to process XML with Python. It contains, among other things xmlproc: a validating XML parser. Expat: a fast non-validating parser. sgmlop: a C helper module that can speed-up xmllib.py and sgmllib.py by a factor of 5. PySAX: SAX 1 and SAX2 libraries with drivers for most of the parsers. 4DOM: A fully compliant DOM Level 2 implem
no-fluff 2007/07/15
あとで読む
リンク
encoding="UTF-16" ってUTF-16で書かれても。
参考書の受け売りでスイマセン。O'Reilly から出ている "XML in a nutshell" というリファレンス本に、XML パーサのエンコーディング想定処理についての記述がありました。以下、拙訳ですが、当該部分の引用です。 -- * -- * -- (略) XML パーサは、文書の最初の数バイトからその文字セットの想定を試みるだろう。パーサによる基本的なチェックは以下の処理を含む：・もし最初の２バイトが #xFEFF の場合、パーサはそのバイト列をユニコードのバイトオーダーマーク(BOM)と認識するであろう。そして、その文章はビッグエンディアンのユニコードの UCS-2 (訳注; UTF-16)で記述されていると想定される。その知識に基づいて、残りの文章を読み進める。・もし最初の２バイトが #xFFFE の場合、パーサはそのバイト列をユニコードのバイトオーダーマーク(BOM)
no-fluff 2007/07/15
UTF-16な文書のXML宣言はUTF-16で書く？それともUS-ASCII？

xml

encoding
リンク
紫藤のページ
紫藤のページにようこそ！ Lisp, Haskell, Python, Sather などの関数型言語の解説、小品プログラムを載せています。ここでは関数型プログラミング言語をものすごく広い意味で使っており、手続きをデータとして扱える言語を指します。（Python, Sather などは普通はオブジェクト指向言語に分類されます。）また、xyzzy のマクロに関するメモもあります。面白そうな割りに、web 上に文書が少ない題材を取り上げています。リンクはご自由にお張りください。更新ログ [Sep 15, 2014](Python) Set を用いたライフゲームの実装を作成 [Jul 21, 2014]デコレータを用いた bottle.py のアクセスコントロールを作成 [Jul 13, 2014]Maildir のバックアップを作成 [Jun 28, 2014]自宅サーバの
no-fluff 2007/07/15
リンク
ElementTree Overview
In this post, we have explained in detail how to deal with XML files in Python. We will also see some of the complex terms of ElementTree module. What are XML files?Extensible Markup Language (XML) is a file format that is used for the serialization of data, that is, storing, transmitting, and reconstructing arbitrary data, in a format that is both human-readable and machine-readable. As a markup
no-fluff 2007/07/15
python

xml

elementtree
リンク
XML 1.0
拡張可能マークアップ言語 1.0 この文書は、W3Cにより作成されW3C勧告として公開されている "Extensible Markup Language (XML) 1.0" (http://www.w3.org/TR/1998/REC-xml-19980210) を、どら猫本舗が翻訳したものです。最新版の仕様書は http://www.w3.org/TR/REC-xml にあります。正式な仕様書はW3Cサイトにある英語版です。この日本語版は参考にすぎません。この文書には翻訳上の誤りがあるかもしれません。どら猫本舗は翻訳の正確性を保証しません。あくまでご自身の責任でご利用ください。お気付きの点がありましたらどら猫本舗までお知らせください。 REC-xml-19980210 拡張可能マークアップ言語 (Extensible Markup Language) 1.0 W3C勧告 199
no-fluff 2007/07/15
リンク
XSLT 1.0 目次 | 標準情報(TR) TR X 0048:2001
まえがき序文 1 導入 1.0 適用範囲 1.1 概要 2 スタイルシート構造 2.1 XSLT 名前空間 2.2 スタイルシート要素 2.3 スタイルシートとしてのリテラル結果要素 2.4 修飾された名前 2.5 前方互換処理 2.6 スタイルシートの結合 2.6.1 スタイルシートの取込み 2.6.2 スタイルシートのインポート 2.7 スタイルシートの埋込み 3 データモデル 3.1 根ノードの子供 3.2 基底URI 3.3 解析対象外実体 3.4 空白の削除 4 式 5 テンプレート規則 5.1 処理モデル 5.2 パタン 5.3 テンプレート規則の定義 5.4 テンプレート規則の適用 5.5 テンプレート規則の競合解決 5.6 テンプレート規則の上書き 5.7 モード 5.8 組込みテンプレート規則 6 名前付きテンプレート 7 結果木の生成 7.1 要素及び属性の生成 7.
no-fluff 2007/07/15
リンク
XML Path Language 1.0 (XPath 日本語訳)
9月21日に開催された大和インベスター・リレーションズ株式会社主催の「個人投資家向け会社説明会 in 名古屋」のアーカイブ動画と説明資料を掲載しました
no-fluff 2007/07/15
リンク
- 2007年7月16日
- 2007年7月15日
- 2007年7月13日