[B! machinelearning] fcicqのブックマーク

GitHub - mosaicml/composer: Supercharge Your Model Training

fcicq 2022/06/19

faster training. see also bolt

machinelearning

リンク

Universal Speech Enhancement With Score-based Diffusion

Universal Speech Enhancement With Score-based Diffusion This is the companion page of UNIVERSE, the universal speech enhancer described in the paper “Universal Speech Enhancement With Score-based Diffusion” by Joan Serrà, Santiago Pascual, Jordi Pons, R. Oguz Araz, and Davide Scaini. To access the paper, click here. In this page you will find basic information about the paper, three sets of speech

fcicq 2022/06/16

arxiv 2206.03065

machinelearning

リンク

VOSK Offline Speech Recognition API

РУС 中文 Vosk is a speech recognition toolkit. The best things in Vosk are: Supports 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish, Japanese, Esperanto, Hindi, Czech, Polish, Uzbek, Korean, Breton, Gujarati. More to come. Works offlin

fcicq 2022/05/16

machinelearning

リンク

GitHub - xinntao/Real-ESRGAN: Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

🔥 AnimeVideo-v3 model (动漫视频小模型). Please see [anime video models] and [comparisons] 🔥 RealESRGAN_x4plus_anime_6B for anime images (动漫插图模型). Please see [anime_model] 💥 Update online Replicate demo: Online Colab demo for Real-ESRGAN: | Online Colab demo for for Real-ESRGAN (anime videos): Porta ble Windows / Linux / MacOS executable files for Intel/AMD/Nvidia GPU. You can find more information here

fcicq 2022/05/08

machinelearning

リンク

GitHub - magenta/mt3: MT3: Multi-Task Multitrack Music Transcription

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert

fcicq 2022/01/03

machinelearning

リンク

google-research/scann at master · google-research/google-research

ScaNN (Scala ble Nearest Neighbors) is a method for efficient vector similarity search at scale. This code release implements [1], which includes search space pruning and quantization for Maximum Inner Product Search and also supports other distance functions such as Euclidean distance. The implementation is designed for x86 processors with AVX2 support. ScaNN achieves state-of-the-art performance

fcicq 2021/12/25

commercial ver: Vertex AI Matching Engine. image search: mobilenet v2 embedding

リンク

深層学習時代の文字認識とその周辺 / OCR and related technologies in the Deep Learning era

■イベント  ：【SenseTime Japan × Sansan】画像処理勉強会 https://sansan.connpass.com/event/230636/ ■登壇概要タイトル：深層学習時代の文字認識とその周辺発表者：  技術本部 DSOC R&D研究員　宮本優一 ▼Twitter https://twitter.com/SansanRandD

fcicq 2021/12/04

リンク

GitHub - CorentinJ/Real-Time-Voice-Cloning: Clone a voice in 5 seconds to generate arbitrary speech in real-time

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert

fcicq 2021/10/30

machinelearning

リンク

GitHub - tensorflow/similarity: TensorFlow Similarity is a python package focused on making similarity learning quick and easy.

Tensorflow Similarity offers state-of-the-art algorithms for metric learning along with all the necessary components to research, train, evaluate, and serve similarity and contrastive based models. These components include models, losses, metrics, samplers, visualizers, and indexing subsystems to make this quick and easy. With Tensorflow Similarity you can train two main types of models: Self-supe

fcicq 2021/09/16

machinelearning

リンク

GitHub - openai/CLIP: CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert

fcicq 2021/08/19

machinelearning

リンク

GitHub - AsuharietYgvar/AppleNeuralHash2ONNX: Convert Apple NeuralHash model for CSAM Detection to ONNX.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert

fcicq 2021/08/18

MobileNetV3 based https://github.com/KhaosT/nhcalc see also imagehash, ball tree (scikit)

リンク

RAMS-Trans: Recurrent Attention Multi-scale Transformer forFine-grained Image Recognition

fcicq 2021/07/31

machinelearning

リンク

NN-512

fcicq 2021/07/18

avx512 required. in golang but compile to c99. fastest on cpu

リンク

Using PyTorch + NumPy? You're making a mistake.

Update: The probl em is now fixed in Pytorch but can still happen in tensorflow-keras. Discussion on reddit. Bugs in ML code are notoriously hard to fix - they don’t cause compile errors but silently regress accuracy. Once you have endured the pain and fixed one of these, the lesson is forever etched into your brain, right? Wrong. Recently, an old foe made a comeback - a familiar bug bit me again!

fcicq 2021/04/16

initialize random generator with worker id and epoch

リンク

2021年最強になるか！？最新の画像認識モデルEfficientNetV2を解説 - Qiita

その他層の数も探索空間に入れています。ここで拡張率とは、MBConvの最初のConvでチャネル数を何倍にするかの係数のことで、こちらでより詳しく解説しています。探索は精度$A$、ステップごとの学習時間$S$、パラメータサイズ$P$を用いて、$A\cdot S^w\cdot P^v$を最大化するように行われます。ここで$w=-0.07, v=-0.05$であり、これらの値は実験的に決定されています。 1.3.2 EfficientNetV2のアーキテクチャ下表がEfficientNetV2のSサイズのモデルになります。画像: "EfficientNetV2: Smaller Models and Faster Training", Tan, M., Le, Q., (2021) 比較のためにEfficientNet-B0(i.e. V1)のアーキテクチャも下に載せます。画像: "Ef

fcicq 2021/04/14

machinelearning

リンク

Non-Local Musical Statistics as Guides for Audio-to-Score Piano Transcription

fcicq 2021/03/18

audio2score

machinelearning

リンク

NVIDIA L4T Base | NVIDIA NGC

fcicq 2021/01/22

Identifying Similar Images with TensorFlow

fcicq 2021/01/11

machinelearning

リンク

GitHub - elcorto/imagecluster: Cluster images based on image content using a pre-trained deep neural network, optional time distance scaling and hierarchical clustering.

fcicq 2021/01/11

kgg16 to 4096 feature

リンク

PyTorch, ONNX, Caffe, OpenVINO (NCHW) のモデルをTensorflow / TensorflowLite (NHWC) へお手軽に変換する - Qiita

PyTorch, ONNX, Caffe, OpenVINO (NCHW) のモデルをTensorflow / TensorflowLite (NHWC) へお手軽に変換するDeepLearningCaffeTensorFlowPyTorchONNX 日本語　English 1. はじめにいつも左中間を狙うようなプチニッチなふざけた記事ばかりを量産しています。この記事の手順を実施すると、最終的に PyTorch製高精度Semantic Segmentation の U^2-Net を TensorFlow Lite へ変換することができます。下図のような感じです。 TensorFlow めちゃくちゃ扱いにくいです。日々公開される最新のとても面白いモデルは軒並みPyTorch実装ですし、なんでTensorFlowで実装してくれないんだ！！と、常日頃思っています。論文のベンチマ

fcicq 2020/12/06

machinelearning

リンク

はてなブックマーク

タグ

関連タグで絞り込む (55)

machinelearningに関するfcicqのブックマーク (259)

お知らせ

今週のはてなブックマーク数ランキング（2024年8月第3週）

今週のはてなブックマーク数ランキング（2024年8月第2週）

今週のはてなブックマーク数ランキング（2024年8月第1週）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス