[DeepLearning]Tools, Frameworks, Examples & Papers

keywords: DeepLearning, Tools, Frameworks, Examples & Papers

Architecture

RegNet

Nonrigid image registration using multi-scale 3D convolutional neural networks
https://github.com/hsokooti/RegNet

Codebase for Image Classification Research, written in PyTorch.
https://github.com/facebookresearch/pycls

Pytorch implementation of network design paradigm described in the paper “Designing Network Design Spaces”
https://github.com/signatrix/regnet

Facebook AI RegNet Models Outperform EfficientNet Models, Run 5x Faster on GPUs
https://medium.com/syncedreview/facebook-ai-regnet-models-outperform-efficientnet-models-run-5x-faster-on-gpus-7bdc3ea577ae

Platform

PyTorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration.
https://github.com/pytorch/pytorch

Fine-tune pretrained Convolutional Neural Networks with PyTorch
https://github.com/creafz/pytorch-cnn-finetune

Colossal

Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training
https://github.com/hpcaitech/ColossalAI

Sources

Comprehensive Library

Open Source Computer Vision Library
https://github.com/opencv/opencv

A collection of various deep learning architectures, models, and tips
https://github.com/rasbt/deeplearning-models

Machine learning, in numpy
https://github.com/ddbourgin/numpy-ml

Virginia Tech Vision and Learning Lab
https://github.com/vt-vl-lab

scikit-learn: machine learning in Python
https://scikit-learn.org/
https://github.com/scikit-learn/scikit-learn

Cross-platform, customizable ML solutions for live and streaming media. Face Detection, Face Mesh, Iris, Hands, Pose, Holistic.
https://github.com/google/mediapipe

Management Tools

Fastest unstructured dataset management for TensorFlow/PyTorch. Stream data real-time & version-control it.
https://github.com/activeloopai/Hub

This repository includes optimized deep learning models and a set of demos to expedite development of high-performance deep learning inference applications. Use these free pre-trained models instead of training your own models to speed-up the development and production deployment process.
https://github.com/openvinotoolkit/open_model_zoo

This toolkit allows developers to deploy pre-trained deep learning models through a high-level C++ Inference Engine API integrated with application logic.
https://github.com/openvinotoolkit/openvino

Recommender System

An open source recommender system service written in Go.
https://github.com/zhenghaoz/gorse

Shape Representation

Learning Continuous Signed Distance Functions for Shape Representation
https://github.com/facebookresearch/DeepSDF

This repository contains the code for the paper “Occupancy Networks - Learning 3D Reconstruction in Function Space”
https://github.com/autonomousvision/occupancy_networks

3D Data

PyTorch3D is FAIR’s library of reusable components for deep learning with 3D data
https://github.com/facebookresearch/pytorch3d

Object Detection

Detectron2 is FAIR’s next-generation research platform for object detection and segmentation.
https://github.com/facebookresearch/detectron2

This project provides the implementation for DetNAS: Backbone Search for Object Detection.
https://github.com/megvii-model/DetNAS

Face Recognition

Deepfakes Software For All
https://github.com/deepfakes/faceswap

An open source library for face detection in images. The face detection speed can reach 1500FPS.
https://github.com/ShiqiYu/libfacedetection

DeepFaceLab is the leading software for creating deepfakes.
https://github.com/iperov/DeepFaceLab

Image Recognition

Deep learning model for recognizing puzzle patterns in The Witness.
https://github.com/wandb/witness
I trained a robot to play The Witness
https://www.wandb.com/articles/i-trained-a-robot-to-play-the-witness
Is a Realistic Honey Simulation Possible?
https://www.youtube.com/watch?v=7SM816P5G9s

Fake Detector

FaceForensics++: Learning to Detect Manipulated Facial Images
https://github.com/ondyari/FaceForensics
https://arxiv.org/pdf/1901.08971.pdf

CenterFace(size of 7.3MB) is a practical anchor-free face detection and alignment method for edge devices.
https://github.com/Star-Clouds/CenterFace

Image Augmentation

Image augmentation for machine learning experiments.
https://github.com/aleju/imgaug

Depth-Aware video frame INterpolation (DAIN)

Depth-Aware Video Frame Interpolation (CVPR 2019)
https://github.com/baowenbo/DAIN

Gesture Recognizer

Gesture recognition via CNN. Implemented in Keras + Tensorflow/Theano + OpenCV
https://github.com/asingh33/CNNGestureRecognizer

基于卷积神经网络的数字手势识别安卓APP，识别数字手势0-10（The number gestures recognition Android APP based on convolutional neural network(CNN), which can recognize the gestures corresponding number 0 to 10）
https://github.com/tz28/Chinese-number-gestures-recognition

Real-time Hand Gesture Recognition with PyTorch on EgoGesture, NvGesture and Jester
https://github.com/ahmetgunduz/Real-time-GesRec

Image & Video Classification

An end-to-end PyTorch framework for image and video classification
https://github.com/facebookresearch/ClassyVision

Turi Create simplifies the development of custom machine learning models.
https://github.com/apple/turicreate

Image Generataion

Progressive Growing of GANs for Improved Quality, Stability, and Variation
https://github.com/tkarras/progressive_growing_of_gans

Hacking

A small course on exploiting and defending neural networks
https://github.com/Kayzaks/HackingNeuralNetworks

AI Path Tracer Denoiser

This project is to create a CUDA accelerated Deep learning approach to denoise renders from a path tracer
https://github.com/Black-Phoenix/Ai-Path-Tracer-Denoiser

Inpainting

[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting
https://github.com/vt-vl-lab/3d-photo-inpainting

NVIDIA Merlin is a framework for building high-performance, deep learning-based recommender systems.
https://developer.nvidia.com/nvidia-merlin
Announcing NVIDIA Merlin: An Application Framework for Deep Recommender Systems
https://devblogs.nvidia.com/announcing-nvidia-merlin-application-framework-for-deep-recommender-systems/

Pose Estimation

COCO-WholeBody dataset is the first large-scale benchmark for whole-body pose estimation.
https://github.com/jin-s13/COCO-WholeBody

GCN (Graph Convolutional Networks)

Graph Convolutional Networks in PyTorch
https://github.com/tkipf/pygcn

Graph Convolution Network for PyTorch
https://github.com/dragen1860/GCN-PyTorch

Hierarchical Task Network (HTN)

Hierarchical task network
https://en.wikipedia.org/wiki/Hierarchical_task_network

Data Loading

The NVIDIA Data Loading Library (DALI) is a GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
https://github.com/NVIDIA/DALI

Papers

Shape Representation

DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation
http://openaccess.thecvf.com/content_CVPR_2019/html/Park_DeepSDF_Learning_Continuous_Signed_Distance_Functions_for_Shape_Representation_CVPR_2019_paper.html

Occupancy Networks: Learning 3D Reconstruction in Function Space
https://avg.is.tuebingen.mpg.de/publications/occupancy-networks

Physical Animation

Try W&B to track machine learning experiments and visualize results
https://www.wandb.com/papers
AI Learns To Compute Game Physics In Microseconds
https://www.youtube.com/watch?v=atcKO15YVD8

Hardware Architecture

Large-scale Deep Unsupervised Learning using Graphics Processors
http://robotics.stanford.edu/~ang/papers/icml09-LargeScaleUnsupervisedDeepLearningGPU.pdf

Fast Support Vector Machine Training and Classification on Graphics Processors
https://www2.eecs.berkeley.edu/Pubs/TechRpts/2008/EECS-2008-11.pdf

HeteroSpark: A Heterogeneous CPU/GPU Spark Platform for Machine Learning Algorithms
https://www.researchgate.net/profile/Peilong_Li/publication/282865411_HeteroSpark_A_Heterogeneous_CPUGPU_Spark_Platform_for_Machine_Learning_Algorithms/links/56205bd908aed8dd19404816/HeteroSpark-A-Heterogeneous-CPU-GPU-Spark-Platform-for-Machine-Learning-Algorithms.pdf

Transformations

Self-learning Transformations for Improving Gaze and Head Redirection
https://ait.ethz.ch/projects/2020/STED-gaze/
https://arxiv.org/abs/2010.12307
https://arxiv.org/pdf/2010.12307.pdf

3D Scenes

Pose2Room: Understanding 3D Scenes from Human Activities
https://yinyunie.github.io/pose2room-page/

Racing Game

Game AI: Simulating Car Racing Game by Applying Pathfinding Algorithms
http://www.ijmlc.org/papers/82-A1090.pdf

Reinforcement Learning for a Simple Racing Game
https://web.stanford.edu/class/aa228/reports/2018/final150.pdf

Blogs

Comprehensive

Research in human-centered AI, deep learning, autonomous vehicles & robotics at MIT and beyond.
https://lexfridman.com/

Math

Using neural networks to solve advanced mathematics equations
https://ai.facebook.com/blog/using-neural-networks-to-solve-advanced-mathematics-equations/

Tutorials

Youtube Tutorials

Neural Network Architectures
https://www.youtube.com/watch?v=oJNHXPs0XDk

MindsDB is a predictive AI layer for existing databases, making it easy for organizations to apply machine learning to their own data.
https://www.youtube.com/channel/UC5_wBOLCWath6q1iTgPPD5A
Machine Learning in one line of code
https://github.com/mindsdb/mindsdb

OpenAI Plays Hide and Seek…and Breaks The Game!
https://www.youtube.com/watch?v=Lu56xVlZ40M

Books

PyTorch

Deep Learning for Coders with fastai and PyTorch: AI Applications Without a PhD 1st Edition (July 14, 2020)
https://www.amazon.com/Deep-Learning-Coders-fastai-PyTorch/dp/1492045527

Deep Learning with PyTorch 1st Edition (June 9, 2020)
https://www.amazon.com/Deep-Learning-PyTorch-Eli-Stevens/dp/1617295264

Commercial Product

Code Review

Write better code with the knowledge of the global development community.
https://www.deepcode.ai/

Voice

Instantly Transform Any Text Into A 100% Human-Sounding Voiceover with only 3 clicks!
https://speechelo.com/

Visual

An A.I. agent for visual tasks
https://deepai.org/zendo

Digital Creative

Runway is a new kind of creative suite. One where AI is a collaborator and anything you can imagine can be created.
https://runwayml.com/

Stable Diffusion is a deep learning, text-to-image model released in 2022. https://stability.ai/

Completions

AI Completions. Never Code Alone.
https://www.tabnine.com/

Twitter Accounts

Research Scientist at @GoogleAI in the Brain Team. Deep Learning with Graphs.
https://twitter.com/thomaskipf

maths, visualisations, conversational AI. lead scientist @Poly_AI, previously @GoogleAI, PhD @Cambridge_Uni. living in Singapore.
https://twitter.com/matthen2

Dataset

Dance Motion

This repo contains starter code for using the AIST++ dataset.
https://github.com/google/aistplusplus_api
https://google.github.io/aistplusplus_dataset/

It was inevitable: the scent of bitter almonds always reminded him of the fate of unrequited love. ― Gabriel Garcia Marquez

[DeepLearning]Tools, Frameworks, Examples & Papers

Architecture

RegNet

Platform

PyTorch

Colossal

Sources

Comprehensive Library

Management Tools

Deployment Related

Recommender System

Shape Representation

3D Data

Object Detection

Face Recognition

Image Recognition

Fake Detector

Image Augmentation

Depth-Aware video frame INterpolation (DAIN)

Gesture Recognizer

Image & Video Classification

Image Generataion

Hacking

AI Path Tracer Denoiser

Inpainting

Pose Estimation

GCN (Graph Convolutional Networks)

Hierarchical Task Network (HTN)

Data Loading

Papers

Shape Representation

Physical Animation

Hardware Architecture

Transformations

3D Scenes

Racing Game

Blogs

Comprehensive

Math

Tutorials

Youtube Tutorials

Books

PyTorch

Commercial Product

Code Review

Voice

Visual

Digital Creative

Completions

Social Accounts

Twitter Accounts

Dataset

Dance Motion

Wang Aiguo