Releases: google-ai-edge/mediapipe
Releases · google-ai-edge/mediapipe
Experimental new framework APIs
- Released experimental new framework APIs
- Updated MediaPipe Objectron to support simultaneous detection of multiple objects
- Bug fixes
- Fixed a crash in the MediaPipe Iris iOS example app
- Fixed memory leak in Android example apps (in ExternalTextureConverter)
- Fixed a file read mode issue in pose_landmark_model_loader.pbtxt.
MediaPipe Holistic, and MediaPipe in Python and JavaScript
- MediaPipe Holistic for real-time simultaneous tracking of human pose, face and hand landmarks
- MediaPipe in Python
- Ready-to-use Python Solution APIs for MediaPipe Face Mesh, Hands, Pose and Holistic
- Try it in MediaPipe on Google Colab
- MediaPipe in JavaScript
- Ready-to-use JavaScript Solution APIs for MediaPipe Face Mesh, Hands, Pose and Holistic
- Try it in MediaPipe on CodePen
- Switch build from C++14 to C++17
Python Solution APIs
- MediaPipe in Python
- Released Python Solution API for MediaPipe Face Mesh and MediaPipe Hands
- Updated Python Solution API for MediaPipe Pose
- Also released usage examples as Google Colabs
- MediaPipe Objectron
- Released a faster two-stage pipeline
- Added support for more object classes: shoe, chair, cup and camera
- Released training dataset, (to be) announced in a Google AI Blog post
- New Tensor class
- Released as a multi-dimensional tensor data container, supporting multiple backends like CPU, Metal buffer, GL buffer and GL texture 2D
- Added new Tensor-based pre-processing, inference and post-processing calculators in mediapipe/calculators/tensor, branched from existing calculators in mediapipe/calculators/tflite
- Most of the (sub-)graph in mediapipe/modules have been updated to use Tensor and the associated calculators. The plan is to fully migrate all in the repo (and deprecate mediapipe/calculators/tflite by end of 2020.
- MediaPipe Hands
- Refactored graphs to depend on the new palm_detection and hand_landmark module.
- Improved model speed for both palm detection and hand landmark.
- Extended the main hand tracking example apps to support multiple hands, to replace the separate multi-hand tracking examples.
- MediaPipe Face Detection
- Refactored graphs to depend on the face_detection module.
3D Face Transform in MediaPipe Face Mesh
- Added 3D Face Transform to MediaPipe Face Mesh
- Enables Face Effect Rendering
- Demonstrated with an Face Effect Example App
- More info in Google Developers Blog post
- Added MediaPipe Models and Model Cards doc
MediaPipe Instant Motion Tracking
- MediaPipe Instant Motion Tracking that performs AR tracking without initialization or calibration.
- MediaPipe BlazePose in Python now uses
pip install mediapipe
instead of building from source. Also published a Colab example. - MediaPipe Iris updated to output a set of 478 3D landmarks, including 468 face landmarks from MediaPipe Face Mesh, with those around the eyes further refined, and 10 additional iris landmarks appended at the end.
MediaPipe Pose
- MediaPipe Pose for upper-body pose tracking
- Preview of Python support that runs MediaPipe Pose in Python interpreter
MediaPipe Iris - Iris tracking and depth estimation
- MediaPipe Iris for iris tracking and single-image depth-from-iris
- Fixed mirrored text rendering in iOS example apps
- Google AI blog post on MediaPipe Iris
- Iris tracking web demo
- Depth estimation from Iris web demo
Automatic provisioning for iOS examples
- Added support for automatic provisioning for building iOS examples
- Fixed sample trace file (#849)
Changes in Z coordinate normalization in 3D landmarks
- TfLiteTensorsToLandmarksCalculator
- It now always normalizes Z coordinates by image width (as for X) when producing NORM_LANDMARKS output, assuming a weak perspective projection camera model.
- The normalize_z option can be used to further normalize Z coordinates by an additional factor. For instance, normalize_z: 0.4 is now used in hand tracking to better account for the Z coordinate distribution in the training data.
- Bug fixes
- Fixes issues on Ubuntu 20.04, resolving #820.
- Documentation update
- Added instructions for building for Nvidia Jetson and Raspberry Pi devices with ARM Ubuntu.
MediaPipe Hand AutoFlip updated, Windows support, Tracing/profiling
- MediaPipe Hand
* Updated model, documentation and pipeline
* Improved hand tracking quality in various scenarios
* Added handedness prediction, e.g. left or right hand.
* See tweet for visualization - MediaPipe Face Detection
* Updated model for front-facing camera
* Added model for back-facing camera, which better handles smaller faces - MediaPipe AutoFlip
* Updated to use face detection model for back-facing camera - Support for running MediaPipe on Windows - experimental
* MediaPipe TFLite CPU desktop live and offline demos only. No TFLite GPU and TensorFlow support yet.
* Compiled by MSVC through Bazel. No MinGW and Cygwin support yet.
* MediaPipe Android apps are NOT buildable on native Windows yet. - Tracing and profiling MediaPipe graphs - experimental
* Support for collecting trace logs on Android, iOS, Linux
* To analyze the trace logs, MediaPipe Visualizer - Community related
* Awesome MediaPipe: curation of code related to MediaPipe
* Slack community for MediaPipe users