[Framework] Vision framework. The application I want to make requires… | by XCoder

[Framework] Vision framework. The application I want to make requires… | by XCoder | Jun, 2023

The Tech Guy June 17, 2023 2 min read

The application I want to make requires computer vision technology, so I searched for it and
found out about the Vision framework, so I briefly summarized it.

The Vision Framework is a framework that was announced along with coreML at WWDC in 2017.

face detection
Ability to track facial landmarks
Like iPhone’s LiveText, the ability to find text in a photo
General image registration, tracking function

Computer vision algorithms can be applied to an image or video to perform various functions such as the above.

openCV is also one of the very popular computer vision frameworks.

It supports several languages, but unfortunately swift is not supported.
However, since the vision framework was developed by Apple, it supports swift.
Also, it’s my guess, but since it was developed by Apple, isn’t it more compatible with swift? 🤔
OpenCV is an external framework, so it needs to be installed separately, but the Vision framework is
import Visionfinished with just one code.

Pipelines are made up of subparts

request

handler

VNImageRequestHandler: Used when working with still images
VNSequenceRequestHandler: Note the sequence. Used when working on image sequences (videos)

result

VNRequestA request made with a subclass of is passed to the handler.
handler can be of VNImageRequestHandlertypeVNSequenceRequestHandler
The processed result VNObservationis returned as a subclass of .

For example, if a type VNRequestthat is a subclass of is received as a request, a type that is a subclass of is returned as a return.VNDetectFaceRectanglesRequestVNObservationVNFaceObservation

For another example, it is a format VNDetectContoursRequestwhere is received as a request and VNContoursObservationis returned as a return.

By XCoder| LinkedIn | Medium | GitHub

Source link