[B! scikit-learn][python][analytics] [11ページ] nabinnoのブックマーク

nabinno id:nabinno

scikit-learnとpythonとanalyticsに関するnabinnoのブックマーク (315)

GitHub - scikit-learn-contrib/lightning: Large-scale linear classification, regression and ranking in Python
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
nabinno 2017/08/28
github

scikit-learn

python

analytics
リンク
python sklearn でロジスティック回帰。fit して predict、引数 C で正則化 | コード７区
簡単なデータでロジスティク回帰を試してみます。やるのは２クラスの分類ですが、理論的なことはとりあえず置いといて、 python の scikit-learnライブラリを使ってみます。このライブラリの LogisticRegression というクラスがロジスティック回帰です。メソッド fit、predict、score、属性 coef_、intercept_、パラメータ C を使ってみました。 from sklearn.cross_validation import train_test_split from sklearn.preprocessing import StandardScaler import matplotlib.pyplot as plt import numpy as np np.random.seed(seed=0) X_0 = np.random.multiv
nabinno 2017/08/25
scikit-learn

python

analytics
リンク
scikit-learnとflaskで簡単な機械学習✕Webアプリ - Qiita
簡単な機械学習とWebアプリ Webサイトからの入力に対して、機械学習で作った判定モデルで結果を画面に表示する簡単なデモアプリを作ってみました。アヤメのがく片と花びらの大きさ（縦、横）を入力すると、品種を判定して表示します。コードはこちらにあります。 https://github.com/shibuiwilliam/mlweb 全体像こんな感じです。がく片や花びらを入力するフロントエンドと判定して結果を返すバックエンドという構成です。スーパーシンプルです。言語はPython3.6で、Webにはflaskとwtform、機械学習にはscikit-learnを使っています。使い方開発環境はCentOS7.3とPython3.6です。ライブラリとしてflask、wtform、scikit-learn、Jupyter Notebookを入れています。この辺は概ねAnaconda
nabinno 2017/08/20
qiita

scikit-learn

python

analytics

flask
リンク
KDTree
KDTree# class sklearn.neighbors.KDTree# KDTree for fast generalized N-point probl ems Read more in the User Guide. Parameters: Xarray-like of shape (n_samples, n_features)n_samples is the number of points in the data set, and n_features is the dimension of the parameter space. Note: if X is a C-contiguous array of doubles then data will not be copied. Otherwise, an internal copy will be made. leaf_
nabinno 2017/08/19
scikit-learn

python

analytics
リンク
scikit-learnのk-meansで独自の距離関数を定義する - Qiita
この記事で紹介すること kmeansの距離関数って本当にユークリッド距離大正義なの？ scikit-learnのk-meansって何が動いているの？ scikit-learnのk-meansで距離関数を変更するにはどうするの？ k-means法といえば、みなさんご存知、ぐにょぐにょっとデータを分割してくれるアレですね。教科書にもよく出てくるアレなので、細かい説明は省きますが、いまだに使われてるのは、（禁則事項です）ということですよね。あ、ちなみにこういう動画とかこういうツールとか使うと、k-meansの動きがよくわかってナイスだと思います。で、k-meansではセントロイドと各データとの距離を測るために、「距離関数」が必要になります。教科書的には「ユークリッド距離」一択のことが多いのですが、それって本当に正しいのでしょうか？この論文, Anna Huang, "Similarit
nabinno 2017/08/12
qiita

scikit-learn

python

analytics
リンク
Sklearnのpreprocessingの全メソッドを解説 | 自調自考の旅
概要Pythonで機械学習をするときに使うscikit-learn（恐らく、サイキットラーンと呼びます）には前処理用のライブラリであるpreprocessingが組み込まれています。前処理とは、データの欠損保管や正規化、カテゴリデータの数値化等のことです。このpreprocessingの全メソッドの効果を検証したのが今回の記事です。irisのデータを題材にして説明していきます。 #preprocessingはspという名前で使うこととする import sklearn.preprocessing as sp #お決まりのライブラリをインポート import pandas as pd import numpy as np import matplotlib.pyplot as plt if __name__ == "__main__": #irisデータをdfに格納 df = pd.read
nabinno 2017/08/09
scikit-learn

python

analytics
リンク
機械学習超入門 - Qiita
概要社内の勉強会用に「機械学習って何？どうやって使うの？」というテーマでまとめたものです。この記事の内容が他の方の役に立てたら嬉しいです。機械学習＝ AI？機械学習は人工知能の一分野で、ディープラーニングは機械学習の一分野です。ルールベース多重If文や探査により多彩なパターンを網羅して、複雑な条件でも適切な出力がされるようにプログラムされているもの機械学習データのパターンや特徴を学習し、それをもとに未知のデータに対して何かしらの予測を出力するディープラーニングデータの特徴となる要素の取捨選択を自動で行うことができる、機械学習の手法の一つ強化学習ある環境において、　エージェントが状況を観測しながら行動することを繰り返し試行し、目的を達成するための最適な意思決定を学習するもの Point! ルールベースだと、例外が発生すると人が手でルールを書き換える必要があり、どんどん
nabinno 2017/08/02
qiita

scikit-learn

python

analytics

guide
リンク
LocalOutlierFactor
LocalOutlierFactor# class sklearn.neighbors.LocalOutlierFactor(n_neighbors=20, *, algorithm='auto', leaf_size=30, metric='minkowski', p=2, metric_params=None, contamination='auto', novelty=False, n_jobs=None)[source]# Unsupervised Outlier Detection using the Local Outlier Factor (LOF). The anomaly score of each sample is called the Local Outlier Factor. It measures the local deviation of the densi
nabinno 2017/07/21
scikit-learn

python

analytics

scikit-learn-0.20
リンク
Imputing missing values before building an estimator — scikit-learn 0.18.2 documentation
This documentation is for scikit-learn version 0.18.2 — Other versions If you use the software, please consider citing scikit-learn . Imputing missing values before building an estimator Imputing missing values before building an estimator ¶ This example shows that imputing the missing values can give better results than discarding the samples containing any missing value. Imputing does not always
nabinno 2017/07/03
scikit-learn

python

analytics
リンク
Outlier detection with several methods. — scikit-learn 0.19.2 documentation
Outlier detection with several methods.¶ When the amount of contamination is known, this example illustrates three different ways of performing Novelty and Outlier Detection: based on a robust estimator of covariance, which is assuming that the data are Gaussian distributed and performs better than the One-Class SVM in that case. using the One-Class SVM and its ability to capture the shape of the
nabinno 2017/06/28
scikit-learn

python

analytics

scikit-learn-0.18.2
リンク
One-class SVM with non-linear kernel (RBF)
Note Go to the end to download the full example code. or to run this example in your browser via JupyterLite or Binder One-class SVM with non-linear kernel (RBF)# An example using a one-class SVM for novelty detection. One-class SVM is an unsupervised algorithm that learns a decision function for novelty detection: classifying new data as similar or different to the training set. import numpy as n
nabinno 2017/06/28
scikit-learn

python

analytics

scikit-learn-0.18.2

support-vector-machine
リンク
Comparison of Calibration of Classifiers
Note Go to the end to download the full example code. or to run this example in your browser via JupyterLite or Binder Comparison of Calibration of Classifiers# Well calibrated classifiers are probabilistic classifiers for which the output of predict_proba can be directly interpreted as a confidence level. For instance, a well calibrated (binary) classifier should classify the samples such that fo
nabinno 2017/06/15
scikit-learn

python

analytics

scikit-learn-0.18.1

functional-comparison
リンク
DictVectorizer
DictVectorizer# class sklearn.feature_extraction.DictVectorizer(*, dtype=<class 'numpy.float64'>, separator='=', sparse=True, sort=True)[source]# Transf orms lists of feature-value mappings to vectors. This transf ormer turns lists of mappings (dict-like objects) of feature names to feature values into Numpy arrays or scipy.sparse matrices for use with scikit-learn estimators. When feature values ar
nabinno 2017/06/14
scikit-learn

python

analytics
リンク
Density Estimation for a Gaussian mixture
Note Go to the end to download the full example code. or to run this example in your browser via JupyterLite or Binder Density Estimation for a Gaussian mixture# Plot the density estimation of a mixture of two Gaussians. Data is generated from two Gaussians with different centers and covariance matrices. # Authors: The scikit-learn developers # SPDX-License-Identifier: BSD-3-Clause import matplotl
nabinno 2017/06/14
scikit-learn

python

analytics
リンク
1.12. Multiclass and multioutput algorithms
1.12. Multiclass and multioutput algorithms# This section of the user guide covers functionality related to multi-learning probl ems, including multiclass, multilabel, and multioutput classification and regression. The modules in this section implement meta-estimators, which require a base estimator to be provided in their constructor. Meta-estimators extend the functionality of the base estimator
nabinno 2017/06/14
scikit-learn

python

analytics
リンク
cross_val_score
cross_val_score# sklearn.model_selection.cross_val_score(estimator, X, y=None, *, groups=None, scoring=None, cv=None, n_jobs=None, verbose=0, params=None, pre_dispatch='2*n_jobs', error_score=nan)[source]# Evaluate a score by cross-validation. Read more in the User Guide. Parameters: estimatorestimator object implementing ‘fit’The object to use to fit the data. X{array-like, sparse matrix} of shap
nabinno 2017/06/13
scikit-learn

python

analytics

scikit-learn-0.18.1

sklearn.model_selection
リンク
scikit-learnでtf-idf | mwSoft
scikit-learnでtf-idf 概要 tf-idfを出す用事があったので、scikit-learnで実行してみる。例として宮沢賢治の作品から8作品ほどを青空文庫より取得し、それぞれの作品に対してtf-idf上位10件のワードを抽出する。 Pythonは3.5を利用。mecab-python3が入っていること。 @CretedDate 2016/08/04 @Versions python3.5, mecab0.996, scikit-learn0.17.1 MeCabで名詞だけ分かち書き「オツベルと象」「風の又三郎」「銀河鉄道の夜」「グスコーブドリの伝記」「セロ弾きのゴーシュ」「注文の多い料理店」「よだかの星」「シグナルとシグナレス」の8作品を使う。落としてきたファイルは事前に解凍してUTF-8に変換しておく。下記とかで。 $ find . -name '*.zip' -ex
nabinno 2017/06/03
scikit-learn

python

analytics
リンク
Introduction to Machine Learning in Python with scikit-learn (video series)
Introduction to Machine Learning in Python with scikit-learn (video series) Update from 2021: This video series is now available as a free online course that includes updated content, quizzes, and a certificate of completion. 👉 Click here to enroll! 👈 In the data science course that I teach for General Assem bly, we spend a lot of time using scikit-learn, Python's library for Machine Learning. I
nabinno 2017/06/02
scikit-learn

python

analytics

e-learning
リンク
sklearn.preprocessing.Imputer — scikit-learn 0.21.3 documentation
class sklearn.preprocessing.Imputer(*args, **kwargs)[source]¶ Imputation transf ormer for completing missing values. Read more in the User Guide. missing_values : integer or “NaN”, optional (default=”NaN”) The placeholder for the missing values. All occurrences of missing_values will be imputed. For missing values encoded as np.nan, use the string value “NaN”. strategy : string, optional (default=”
nabinno 2017/06/01
scikit-learn

python

analytics

scikit-learn-0.18.1

sklearn.preprocessing.lmputer

sklearn.preprocessing
リンク
sklearn.decomposition.RandomizedPCA — scikit-learn 0.19.2 documentation
class sklearn.decomposition.RandomizedPCA(*args, **kwargs)[source]¶ Principal component analysis (PCA) using randomized SVD Deprecated since version 0.18: This class will be removed in 0.20. Use PCA with parameter svd_solver ‘randomized’ instead. The new implementation DOES NOT store whiten components_. Apply transf orm to get them. Linear dimensionality reduction using approximated Singular Value
nabinno 2017/05/29
scikit-learn

python

analytics

scikit-learn-0.18.1

sklearn.decomposition.randomized

sklearn.decomposition
リンク
前のページ 7 8 9 10 11 12 13 14 15 16 次のページ

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx