[B! character-code] nabinnoのブックマーク

nabinno id:nabinno

character-codeに関するnabinnoのブックマーク (14)

GitHub - jagracey/Awesome-Unicode: :joy: A curated list of delightful Unicode tidbits, packages and resources.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
nabinno 2016/06/17
github

unicode

character-code
リンク
Locale (computer software) - Wikipedia
In computing, a locale is a set of parameters that defines the user's language, region and any special variant preferences that the user wants to see in their user interface. Usually a locale identifier consists of at least a language code and a country/region code. Locale is an important aspect of i18n. General locale settings[edit] These settings usually include the following display (output) fo
nabinno 2016/06/14
locale

posix

character-code
リンク
Why does modern Perl avoid UTF-8 by default?
I wonder why most modern solutions built using Perl don't enable UTF-8 by default. I understand there are many legacy probl ems for core Perl scripts, where it may break things. But, from my point of view, in the 21st century, big new projects (or projects with a big perspective) should make their software UTF-8 proof from scratch. Still I don't see it happening. For example, Moose enables strict a
nabinno 2016/05/01
stack-overflow

perl

character-code

utf-8

perl_unicode
リンク
文字列の表記揺れをUnicode正規化で簡単に解決する方法 - Qiita
ユニコード正規化をすると、半角英数字や機種依存文字などの表記が統一できます。表記ブレが吸収されることで検索性が高まったり、データの比較なども行いやすくなります。正規化の手法にはNFD, NFC, NFKD, NFKCがありますが、その中でもNFKCという次のような正規化を行う方法をコードを交えて紹介します。ウ゛ェ → ヴェＡＢＣ → ABC ① → 1 ㊤ → 上 Ⅲ → III ㌶ → ヘクタールﾊﾝｶｸｶﾅ → ハンカクカナ ﹣ → - ※ 左辺はU+FE63 Small Hyphen-Minus: 小さいハイフンマイナス－ → - ※ 左辺はU+FF0D Fullwidth Hyphen-Minus: 全角ハイフンマイナス動作環境
nabinno 2016/04/28
qiita

ruby

unicode

character-code
リンク
UTF-8-MAC は文字コードかな… | ものかの
nabinno 2016/04/28
hfs-plus

utf-8-mac

character-code

perl

ruby

iconv

normalization-form-composition
リンク
Index
Automatic redirect: http://home.unicode.org/
nabinno 2016/04/27
unicode

character-code
リンク
UAX #15: Unicode Normalization Forms
Summary This annex describes normalization forms for Unicode text. When implementations keep strings in a normalized form, they can be assured that equivalent strings have a unique binary representation. This annex also provides examples, additional specifications regarding normalization of Unicode text, and information about conformance testing for Unicode normalization forms. Status This documen
nabinno 2016/04/27
unicode

unicode-8.0.0

normalization-form-composition

normalization-form-decomposition

character-code
リンク
小形克宏の「文字の海、ビットの舟」 JIS X 0213の改正は、文字コードにどんな未来をもたらすか（8）　番外編：改正JIS X 0213とUnicodeの等価属性／正規化について（下）
小形克宏の「文字の海、ビットの舟」――文字コードが私たちに問いかけるもの特別編25 JIS X 0213の改正は、文字コードにどんな未来をもたらすか（8）　番外編：改正JIS X 0213とUnicodeの等価属性／正規化について（下） ■符号表だけではないUnicodeの存在理由前回は等価属性と正規化について説明したが、これを通してわかることは、Unicodeとは、けっして文字と符号位置の対応を規定しただけのものではないということだ。Unicodeがもくろんでいるのは世界中の文字の符号化だ。しかし本当にそれができたとしても、実装する、つまり仕事に使うためには、符号表だけではソリューションたりえない。なぜなら文字とは、語の中での位置、文脈の種類、書き進める方向等々、さまざまな状況に応じて、思いもかけない変化をするものだからだ。だからこそ文字は生きていると言えるのだし、そうした振る舞い
nabinno 2016/04/27
impress

katsuhiro-ogata

macos

normalization-form-decomposition

hfs-plus

character-code
リンク
でらうま倶楽部 : Emacs24.4の文字コード設定を見直す
2014年10月29日16:09 カテゴリEmacs Emacs24.4の文字コード設定を見直すコード書いとる?? 自分にとって手に馴染んだテキストエディタEmacsが24.4になって一週間。色々環境整備を続けとります。ずっと引き継いでる初期設定ファイルが秘伝のタレみたいになってるのはいくない!! クリーンな環境だいじ!!!! というわけで、今回はEmacs24.4の言語設定に関して整理した内容のメモ。ちなみにEmacs24.4はこちらで解説されているパッチを当ててビルドしたのを使ってます。まずは言語設定の一覧。 set-language-environment ;; 実行環境(OSX Emacs24.4ではJapanese) set-default-coding-systems ;; 以下の値を変更 ;; buffer-file-coding-system ;; default-
nabinno 2016/04/24
emacs

emacs-24.4

utf-8-hfs

utf-8

character-code

hfs-plus
リンク
Which coding system should I use in Emacs?
I have just tried to save a simple *.rtf file with some websites and tips on how to use emacs and I got These default coding systems were tried to encode text in the buffer `notes.rtf': (iso-latin-1-dos (315 . 8216) (338 . 8217) (1514 . 8220) (1525 . 8221)) However, each of them encountered characters it couldn't encode: iso-latin-1-dos cannot encode these: ‘ ’ “ ” .... etc, etc, etc Now what is t
nabinno 2016/04/24
stack-overflow

emacs

coding-system

character-code
リンク
Unicode正規化
正しい並び替えでは、表示は(A)のままですが、間違った並び替えでは、正規結合クラスが互いに等しいMACRONとACUTEを並び替えたため、表示は(B)のように、eの上のアクセント記号の位置が入れ替わってしまいます。正規分解・互換分解ある文字列の正規分解 (Canonical Decomposition) を得るには、まず、それぞれの文字を正規マッピングによって再帰的に、可能な限り、分解します。すなわち、1回分解した後に現れた文字がなおも分解可能であればさらに分解します。分解マッピングがその文字自身である場合は、分解不可能なので、そのままです。しかし、分解しただけでは必ずしも正しい結果が得られません。つまり、結合文字の順序の一意性を保証するため、分解後の文字列に対して正規順序アルゴリズムを適用しなければなりません。このように、正規マッピングによる再帰的分解と、正規順序アルゴリズムによる
nabinno 2016/04/24
character-code

unicode

normalization-form-decomposition

normalization-form-composition
リンク
キャリッジ・リターン - Wikipedia
機械式タイプライター。左にキャリッジ・リターンを行うレバーが付いている。この項目には、一部のコンピュータや閲覧ソフトで表示できない文字（リターン記号）が含まれています（詳細）。キャリッジ・リターン（carriage return）は本来、テレタイプ端末の Baudot Code における制御文字を指す用語で、行末から行頭に戻す復帰コードであって、改行コードを含まない。その後、タイプライターで一行打ち込んだ後で紙を固定するシリンダー（キャリッジ）を次の行の先頭にタイプできるように戻し（リターン）改行する機構（またはその機構を操作するレバー）を「キャリッジ・リターン」と呼んだ。キャリッジ・リターンが電動で力を入れずに操作できるようになったのは、1960年代のスミス・コロナ製電動タイプライターが最初である。電動タイプライターでその操作を行うキーには "carriage return" または
nabinno 2016/03/28
carriage-return

newline

character-code

character
リンク
Carriage return - Wikipedia
nabinno 2014/08/31
carriage-return

newline
リンク
Shift JIS 2バイト目が0x5C等に成りうることによる問題 - Wikipedia
Shift_JIS（シフトジス）は、コンピュータ上で日本語を含む文字列を表現するために用いられる文字コードの一つ。シフトJIS（シフトジス）と表記されることもある[1]。かつてはベンダーによる独自拡張を含む文字コード群を指した曖昧な名称であったが、1997年にJIS X 0208で標準化された。
nabinno 2012/03/17
shift-jis

mojibake

character-encoding

language

japan
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx