《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
-
Updated
Jul 30, 2024 - Python
Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Ultralytics YOLO26, YOLO11, YOLOv8 — object detection, instance segmentation, semantic segmentation, image classification, pose estimation, object tracking
We write your reusable computer vision tools. 💜
Learn it. Build it. Ship it for others.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Image-to-Image Translation in PyTorch
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
Datasets, Transforms and Models specific to Computer Vision
Computer Vision Annotation Tool (CVAT) is a leading platform for building high-quality visual datasets for vision AI. It offers open-source, cloud, and enterprise products, as well as labeling services, for image, video, and 3D annotation with AI-assisted labeling, quality assurance, team collaboration, analytics, and developer APIs.
Image annotation with Python. Supports polygon, rectangle, circle, line, point, and AI-assisted annotation.
Face recognition using Tensorflow
An open source implementation of CLIP.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Node-based Visual Programming Toolbox
A MNIST-like fashion product database. Benchmark 👇
A collaboration friendly studio for NeRFs
Low-code framework for building custom LLMs, neural networks, and other AI models
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.