[B! scikit-learn][statistics] nabinnoのブックマーク

nabinno id:nabinno

scikit-learnとstatisticsに関するnabinnoのブックマーク (22)

load_iris
nabinno 2020/11/08
scikit-learn

python

sklearn.datasets

data-set

sklearn.datasets.load_iris

statistics
リンク
Amazon.co.jp: scikit-learn データ分析実践ハンドブック (Pythonライブラリ定番セレクション): 拓也,毛利, 廣野,北川, 千代子,澤田, 一徳,谷: 本
nabinno 2020/09/07
takuya-mori

scikit-learn

python

statistics

books
リンク
GitHub - drivendataorg/box-plots-sklearn: An implementation of some of the tools used by the winner of the box plots competition using scikit-learn.
nabinno 2020/08/27
github

box-plots-sklearn

scikit-learn

python

statistics
リンク
Plot different SVM classifiers in the iris dataset
nabinno 2020/08/09
scikit-learn

sklearn.svm

svm

decision-boundary

statistical-classification

statistics
リンク
Lasso
Lasso# class sklearn.linear_model.Lasso(alpha=1.0, *, fit_intercept=True, precompute=False, copy_X=True, max_iter=1000, tol=0.0001, warm_start=False, positive=False, random_state=None, selection='cyclic')[source]# Linear Model trained with L1 prior as regularizer (aka the Lasso). The optimization objective for Lasso is: Technically the Lasso model is optimizing the same objective function as the E
nabinno 2020/07/24
scikit-learn

python

sklearn.linear_model

lasso

regression-analysis

statistics
リンク
cross_validate
cross_validate# sklearn.model_selection.cross_validate(estimator, X, y=None, *, groups=None, scoring=None, cv=None, n_jobs=None, verbose=0, fit_params=None, params=None, pre_dispatch='2*n_jobs', return_train_score=False, return_estimator=False, return_indices=False, error_score=nan)[source]# Evaluate metric(s) by cross-validation and also record fit/score times. Read more in the User Guide. Parame
nabinno 2020/07/24
scikit-learn

python

sklearn.model_selection

cross-validation

data-validation

statistics
リンク
cross_val_scoreはもうやめよう！一発で交差検証するにはcross_validateを使う - 静かなる名辞
はじめに scikit-learnで交差検証を行い、評価指標を算出する方法としては、cross_val_scoreがよくオススメされています。実際、「sklearn 交差検証」みたいな検索キーワードでググるとこの関数がよく出てきます。しかし、この関数は複数の評価指標を算出することができず、一つのスコアしか出力してくれません。これでどういうとき困るかというと、Accuracy, Precision, Recall, F1をすべて出したい・・・というとき、困ります。基本的にこれらはぜんぶ出して評価するものという考え方のもと検証しようとすると、うまくいかないのです。その辺りを柔軟に制御するために、これまで私は自分で交差検証のコードを書いてきました。しかし、そんな必要はありませんでした。cross_validateという関数を使えばいいのです。・・・と、大げさに書いてみましたが、実はこの関数
nabinno 2020/07/24
scikit-learn

python

sklearn.model_selection

cross_val_score

cross_validate

cross-validation

data-validation

statistics
リンク
scikit-learnのcross_val_scoreを使って交差検証(Cross Validation)をする方法 - ランダムの森
機械学習モデルを作る時、与えられたデータを全て用いてモデルの学習・精度向上を行うと、そのデータに対してのみ精度の良いモデル(理想のモデルに近づけていない。)が出来上がってしまい、未知のデータに対して適用できなくなってしまいます。そのため通常、データをあらかじめ学習用と検証用に分けておき、学習用データでモデル作成→検証用データでモデルの精度を確かめるという手順でモデリングを進めていきます。さて、上記のように学習データ内で精度の良いモデルを作るのですが、こちらも学習データに特化したモデルを作ってしまうと、いつまでたっても精度の良いモデルができません。(特に学習データが少ない場合。) この問題を解決する手法が交差検証(Cross Validation)です。今回は交差検証の中でも、K-分割交差検証(k-Fold cross validation)について説明します。 K-分割交差検証では学習
nabinno 2020/07/24
scikit-learn

python

sklearn.model_selection

cross_val_score

cross-validation

data-validation

statistics
リンク
mean_squared_error
nabinno 2020/07/23
scikit-learn

python

sklearn.metrics

mean_squared_error

mean-suqared-error

loss-function

mathematical-optimization

decision-theory

statistics
リンク
roc_curve
nabinno 2020/05/08
scikit-learn

python

sklearn.metrics

roc_curve

roc

statistics
リンク
DecisionTreeClassifier
DecisionTreeClassifier# class sklearn.tree.DecisionTreeClassifier(*, criterion='gini', splitter='best', max_depth=None, min_samples_split=2, min_samples_leaf=1, min_weight_fraction_leaf=0.0, max_features=None, random_state=None, max_leaf_nodes=None, min_impurity_decrease=0.0, class_weight=None, ccp_alpha=0.0, monotonic_cst=None)[source]# A decision tree classifier. Read more in the User Guide. Par
nabinno 2020/05/06
scikit-learn

python

sklearn.tree

decisiontreeclassifier

decision-tree

decision-tree-learning

statistical-classification

machine-learning

statistics

analytics
リンク
Python機械学習プログラミング達人データサイエンティストによる理論と実践 (impress top gear) : Sebastian Raschka, 株式会社クイープ, 福島真太朗 : 本 : Amazon.co.jp
nabinno 2019/11/17
sebastian-raschka

python

statistics

machine-learning

analytics

scikit-learn

matplotlib

jupyter-notebook

books
リンク
f1_score
nabinno 2017/12/07
scikit-learn

python

analytics
リンク
cross_val_score
cross_val_score# sklearn.model_selection.cross_val_score(estimator, X, y=None, *, groups=None, scoring=None, cv=None, n_jobs=None, verbose=0, params=None, pre_dispatch='2*n_jobs', error_score=nan)[source]# Evaluate a score by cross-validation. Read more in the User Guide. Parameters: estimatorestimator object implementing ‘fit’The object to use to fit the data. X{array-like, sparse matrix} of shap
nabinno 2017/06/13
scikit-learn

python

analytics

scikit-learn-0.18.1

sklearn.model_selection
リンク
Building Decision Tree Algorithm in Python with scikit learn - Dataaspirant
Decision Tree Algorithm implementation with scikit learn One of the cutest and lovable supervised algorithms is Decision Tree Algorithm. It can be used for both the classification as well as regression purposes also. As in the previous article how the decision tree algorithm works we have given the enough introduction to the working aspects of decision tree algorithm. In this article, we are going
nabinno 2017/06/02
scikit-learn

decision-tree

supervised-learning

cluster-analysis

statistics

analytics

algorithm

machine-learning
リンク
RandomForest is allocating a lot of memory when using bootstrap option. · Issue #4774 · scikit-learn/scikit-learn
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
nabinno 2017/05/21
github

scikit-learn

python

analytics

random-forest

machine-learning

statistics
リンク
Generalized additive models (GAMs)? · Issue #3482 · scikit-learn/scikit-learn
Hello there, Thanks for making this fantastic library. I use it every day in my bioinformatics research. We're developing a toolkit for single-cell RNA-seq analysis (http://github.com/yeolab/flotilla) and want to add all current state-of-the-art analyses. Unfortunately, most of these are in R. I can reimplem ement some of them, but they rely on certain R packages, in particular VGAM, aka Vector Gen
nabinno 2017/05/21
github

scikit-learn

python

analytics

generalized-additive-model

linear-algebra

statistics
リンク
[MRG+1]: Add decision tree plotting. by nvdv · Pull Request #6380 · scikit-learn/scikit-learn
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
nabinno 2017/05/21
github

scikit-learn

python

analytics

decision-tree

chart

data-visualization

statistics
リンク
https://qiita.com/gash717/items/cea49826f9bd023f2184
- 5 users
- qiita.com
- 学び
nabinno 2017/05/21
qiita

scikit-learn

python

analytics

sparse-matrix

matrix

linear-algebra

statistics
リンク
scikit-learn で最低限の自作推定器(Estimator)を実装する - Qiita
やりたいこと scikit-learn はPythonのほぼデファクトの機械学習ライブラリです．scikit-learnの利点としては多くのアルゴリズムが実装されていることもそうですが，一貫した形で設計されており様々なアルゴリズムを共通したかたちで扱えることです．scikit-learnにないアルゴリズムを新たに実装したり，他のライブラリを使用するときにsciki-learnの他の推定器と同様に扱えるよう実装すれば，もともと実装されている推定器同様にクロスバリデーションで性能を評価したりグリッドサーチでパラメータを最適化したりできます．ここでは最低限の推定器の実装を示します．ここでは識別器または回帰器をターゲットとして考えます(クラスタリングとか教師なし学習とかは考えない)．べたな実装 from sklearn.base import BaseEstimator class MyEsti
nabinno 2017/05/21
qiita

scikit-learn

python

estimator

statistics

analytics
リンク
1 2 次のページ

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx