The development of modern neural networks has brought about a revolution in the field of image generation. One such example is the text-to-image neural network, DALL-E 2, which can generate beautiful art when supplied with good text descriptions (typically referenced as “prompts”). The quality of images generated by DALL-E 2 heavily depends on the proper structure of inputs.
Augmented reality has already proven its positive impact on many businesses. One of the latest trends is so-called WebAR. Indeed, what can be easier than just opening the web page for instant immersive experience? The goal of this project was to develop and optimize the image detection and tracking algorithm for AR applications. The main challenge was to make it work directly in the mobile web front-end with all computations done on the edge.
Our client’s goal was to enhance various printed media (magazines, posters, banners, etc.) with interactive experience using augmented reality. With AR, certain areas on the reading materials can be overlayed with digital information of a different kind: from videos, images, and 3D models to weather information and buttons that bring additional functionality, etc.
The task of automatic document analysis and recognition is very common in everyday life. Basically, every time when a user needs to automatically parse and recognize some content from a picture captured with a mobile phone/tablet or a scanned document – for example, text, tables, links, etc., automatic document recognition and text analysis come to the stage.
Object recognition is an important computer vision and machine learning problem. A specific case is automatic target recognition (ATR) on radar images. ATR can be effectively used for border security, safety systems to identify either man-made objects (such as buildings, ground and air vehicles) or people, as well as for target surveillance. In other words, with ATR one can obtain any visual information about the ground and objects without direct physical contact.
Indoor positioning systems are becoming popular nowadays. Indeed, there is plenty of opportunities for real-time user navigation in GPS-denied environments. An interesting use cases are as follows: Fig. 1. Indoor navigation use cases There are several options for hardware (see It-Jim blog post). We have developed the positioning algorithm based on cheap Bluetooth beacons and built-in IMU sensors on a mobile device.
Synthetic aperture radar (SAR) systems are very popular instrument for high-resolution image of ground surface. Unlike to optical systems, SAR can be used in all weather and lighting conditions. A basic idea of SAR technique is coherent processing of received signals on a moving platform (aircraft or satellite). The main challenge is to perform very precise measurements of platform position at each moment of time.
Our task was to develop the algorithm for the automatic road detection in radar images. The challenge was that the radar images are a bit different from the optical ones. In particular, in the case of synthetic aperture radar (SAR), the image formation process is accomplished via coherent processing of the received signals backscattered from the Earth surface. As a result, the multiplicative speckle noise appears in the SAR images.
Systems and applications for detecting and tracking moving objects, whether people, vehicles, or anything else, are currently in heavy demand. Such tracking is widely used in security and surveillance, military, entertainment, sports, medical imaging, as well as for augmented reality and robotics. In the case of people trackers, many businesses need AI-powered systems tailored for locating, monitoring, counting and analyzing human flows and behavior.