91
WebAR development and deployment: cloud-based or serverless?
WebAR development and deployment: cloud-based or serverless?
Ruslan Timchenko Photo
Ruslan Timchenko CV engineer @It-Jim

Enhancing the physical world with virtual content, connecting real life with the digital world, and making that interaction an immersive experience are the reasons for many businesses to turn to extensive usage of augmented reality (AR). In many cases, however, installation of a specific mobile application is required. Would it not be easier and less time-consuming for a user to have AR directly i...

165
Automatic Floor Segmentation Using Computer Vision
Automatic Floor Segmentation Using Computer Vision
Yurii Chyrka Photo
Yurii Chyrka Head of ML @It-Jim

Automatic floor segmentation can serve many interesting purposes including mixed reality (MR) applications, interior design, entertainment, computation of available space in a room, or indoor robot navigation. In this project, we have been solving a problem of scene understanding and, in particular, determining which pixels of the image belong to the floor.   The problem of floor segmentation is...

210
Becoming a Computer Vision Engineer in 2021
Becoming a Computer Vision Engineer in 2021
Daryna Pesina Photo
Daryna Pesina COO @It-Jim

If you want to dig into Computer Vision (CV) but have no idea where to start, this beginner guide is for you. Here we recommend some sources which will come in handy for learning and understanding both the computer vision and deep learning basics.  When you search for a position of computer vision engineer, you’re likely to see that companies are looking for a candidate with: digital image ...

550
iPhone’s 12 PRO LiDAR: How to Get and Interpret Data
iPhone’s 12 PRO LiDAR: How to Get and Interpret Data
Ruslan Timchenko Photo
Ruslan Timchenko CV engineer @It-Jim

Apple events always amaze the entire world and 2020 was not the exception. Apple presented the first mobile devices equipped with LiDAR: iPad Pro 11 and iPhone 12 Pro (and PRO max version). This active sensor measures physical distances to the objects on a spatial two-dimensional grid. Nowadays it is widespread in the automotive area for object detection and collision avoidance. How can developers...

355
4 Ways How Computer Vision Is Deepening the Fashion Industry
4 Ways How Computer Vision Is Deepening the Fashion Industry
Daryna Pesina Photo
Daryna Pesina COO @It-Jim

What is your first thought when you hear about computer vision (CV) in fashion? Or, what is the first thing that pops into your head when you hear about deep learning fashion? Let us guess – online clothing shopping or virtual try-on applications? Well, this might be surprising but deep fashion is not a far future anymore. What’s more, fashionably speaking, the usage of deep learning in the fa...

159
Computer Vision in Healthcare
Computer Vision in Healthcare
Daryna Pesina Photo
Daryna Pesina COO @It-Jim

Artificial intelligence (AI) and machine learning (ML) are being progressively used across different sectors including healthcare. One of the AI-powered tools is computer vision (CV), the ability to recognize, interpret, and process visual data. Thus, potential applications of computer vision in the medical field are multifold, from image processing and predictive analysis to automated health reco...

127
Applications of Artificial Intelligence in Automotive Industry
Applications of Artificial Intelligence in Automotive Industry
Daryna Pesina Photo
Daryna Pesina COO @It-Jim

A century ago, the very thought of machines being able to think, make complicated calculations, and come up with effective solutions to pressing problems was more of a figment of science fiction writer’s fantasy rather than a foreseeable reality. Still, as we move into the third decade of the 21st century, we cannot imagine our life without manufacturing robots, marketing and stock trading bots,...

225
Practical Aspects of Real-Time Video Pipelines
Practical Aspects of Real-Time Video Pipelines
Oleksiy Grechnyev Photo
Oleksiy Grechnyev CV/ML engineer @It-Jim

Video is an extremely popular way to represent information. Indeed, sometimes it is enough to watch a short clip instead of long listening or reading about complicated technical concepts. From a user’s point of view, a video is just a sequence of images followed one-by-one with a very short inter-frame interval. Typically it has around 30 frames per second (FPS). However, many things are lef...

550
Embedded and Single-Board Computer Vision: Running Deep Neural Nets
Embedded and Single-Board Computer Vision: Running Deep Neural Nets
Oleksiy Grechnyev Photo
Oleksiy Grechnyev CV/ML engineer @It-Jim

Deep learning (DL) and neural networks are extremely widespread in different computer vision (CV) applications. Indeed, many typical problems (like object recognition or semantic segmentation) are effectively solved by the convolutional neural networks (CNNs). In this article, we are going to discuss how to utilize CNNs on embedded devices. Article outline: Neural networks, training and inference ...

888
Embedded and Single-Board Computer Vision: Introduction
Embedded and Single-Board Computer Vision: Introduction
Oleksiy Grechnyev Photo
Oleksiy Grechnyev CV/ML engineer @It-Jim

Computer vision (CV) and machine learning (ML) algorithms solve a tremendous amount of problems. However many businesses often do not understand what hardware to choose for running your favorite neural net or some advanced image and video processing pipelines. With this blog post, we start a series of articles about embedded vision and specific practical things you need to know before making your ...

342
Binary Marker Recognition on Raspberry
Binary Marker Recognition on Raspberry
Ievgen Gorovyi Photo
Ievgen Gorovyi CEO @It-Jim

Fiducial markers are widely used in various applications like robot navigation, logistics, augmented reality. Fig. 1. Applications of fiducial markers Advantages are obvious High contrast Simple code generation Resistance to extremal angles However, when we deal with a large number of markers, real-time recognition becomes challenging, especially on embedded devices with low power CPUs on-board....

1178
Watch your steps: a brief review of step detection using mobile sensors
Watch your steps: a brief review of step detection using mobile sensors
Daryna Pesina Photo
Daryna Pesina COO @It-Jim

In our swarming world, it is quite hard to imagine someone having no mobile phone in the pockets of his jeans, her dress or their suits. Even the inveterate skeptic has to accept the fact that smartphones entered our life and have become its inalienable part, the part of us. The mobile phones became our assistants in all aspects of our life, like filming the greatest events of our life, scheduling...

135
Biological Cells Segmentation
Biological Cells Segmentation
Daryna Pesina Photo
Daryna Pesina COO @It-Jim

The task of accurate cell segmentation is essential for cellular biology and single-cell analysis, as well as for studying biological processes as a whole. In biomedical image processing, this includes reconstruction of microscopy images, foreground segmentation, cell detection, cellular compartments and organelles segmentation. Despite the tremendous progress in microscopy cell imaging and numero...

1028
Overview of Indoor Navigation Technologies
Overview of Indoor Navigation Technologies
Ievgen Gorovyi Photo
Ievgen Gorovyi CEO @It-Jim

Development of indoor navigation services and algorithms is becoming a popular trend in IT-industry in recent years. Some of the modern buildings, like airports, shopping malls, warehouses have grown enough (Fig.1) to feel a need for their own navigation tools for customers. Closed environment conditions exclude the usage of common satellite-based navigation systems like GPS or GLONASS, so nowaday...

1540
Marker-Based Augmented Reality
Marker-Based Augmented Reality
Dmytro Sharapov Photo
Dmytro Sharapov CV engineer @It-Jim

Augmented Reality (AR) is one of the most popular and challenging fields in computer vision research. It allows to supplement real world with some kind of digital content, for example, virtual 3D objects. The key feature of Augment Reality in comparison to other image processing tools is that virtual objects are moved and rotated in 3D coordinates instead of 2D image coordinates. The main objectiv...

1910
Tesseract library configuration
Tesseract library configuration
Dmytro Sharapov Photo
Dmytro Sharapov CV engineer @It-Jim

You’ve undoubtedly seen it before… It’s widely used to process everything from scanned documents to the handwritten scribbles on your tablet PC and Google Translate. And today you’ll create your first app for text recognition. Optical Character Recognition, or OCR, is the process of electronically extracting text from images and reusing it in a variety of ways such as document editing, fre...

1359
Automatic number plate recognition (ANPR) systems
Automatic number plate recognition (ANPR) systems
Ievgen Gorovyi Photo
Ievgen Gorovyi CEO @It-Jim

Currently, the number of cars in the world is well over 1 billion. It is no wonder that one of the most common computer vision tasks is the effective control of these vehicles through automatic number plate recognition (ANPR) systems. The applications of automatic vehicle number plate detection and recognition vary depending on the area of use and include, among others, border control, stolen car ...