pythonで疎な文書ベクトルの類似度や距離を計算をするメモ Scipyで疎行列を使う際の基本的な操作について 書いたのが昔なので、どっか間違ってるかも import scipy.sparse as sp import numpy as np a = sp.lil_matrix((1, 10000)) # 1*10000の疎行列が作成される b = sp.lil_matrix((1, 10000)) # a.shape => (1, 10000) for i in xrange(a.shape[1]): r = np.random.rand() if r < 0.9: r = 0.0 a[0, i] = r # aの各要素にrandomで数値を格納した a # => <1x10000 sparse matrix of type '<type 'numpy.float64'>' with 9
![Scipyでの疎行列の扱い - Qiita](https://cdn-ak-scissors.b.st-hatena.com/image/square/10b67e7c057f8c5fd210537ed50645142782cd12/height=288;version=1;width=512/https%3A%2F%2Fqiita-user-contents.imgix.net%2Fhttps%253A%252F%252Fcdn.qiita.com%252Fassets%252Fpublic%252Farticle-ogp-background-9f5428127621718a910c8b63951390ad.png%3Fixlib%3Drb-4.0.0%26w%3D1200%26mark64%3DaHR0cHM6Ly9xaWl0YS11c2VyLWNvbnRlbnRzLmltZ2l4Lm5ldC9-dGV4dD9peGxpYj1yYi00LjAuMCZ3PTkxNiZoPTMzNiZ0eHQ9U2NpcHklRTMlODElQTclRTMlODElQUUlRTclOTYlOEUlRTglQTElOEMlRTUlODglOTclRTMlODElQUUlRTYlODklQjElRTMlODElODQmdHh0LWNvbG9yPSUyMzIxMjEyMSZ0eHQtZm9udD1IaXJhZ2lubyUyMFNhbnMlMjBXNiZ0eHQtc2l6ZT01NiZ0eHQtY2xpcD1lbGxpcHNpcyZ0eHQtYWxpZ249bGVmdCUyQ3RvcCZzPTM3MGJkMTc1YmJjMmY2MThiOTQwYTY4YmFjNzhhM2Mz%26mark-x%3D142%26mark-y%3D112%26blend64%3DaHR0cHM6Ly9xaWl0YS11c2VyLWNvbnRlbnRzLmltZ2l4Lm5ldC9-dGV4dD9peGxpYj1yYi00LjAuMCZ3PTYxNiZ0eHQ9JTQwcGV0aXR2aW9sZXQmdHh0LWNvbG9yPSUyMzIxMjEyMSZ0eHQtZm9udD1IaXJhZ2lubyUyMFNhbnMlMjBXNiZ0eHQtc2l6ZT0zNiZ0eHQtYWxpZ249bGVmdCUyQ3RvcCZzPThlNjI5ZDA4M2JlYmMxZDYwNzRlYzcxMDFlM2FiZTdk%26blend-x%3D142%26blend-y%3D491%26blend-mode%3Dnormal%26s%3D4b45572eaf44404db34143c18a8423c3)