Calibração de Sistemas de Visão Catadióptricos em Robótica de Futebol

Atualizado em:20/02/2025

Informações do documento

Idioma	Portuguese
Número de páginas	104
Formato	\| PDF
Tamanho	1.25 MB

Resumo

I.Calibration Process

This document proposes a method for calibrating catadioptric systems used in robotic soccer. The proposed method is non-parametric and can be applied to any type of catadioptric system without requiring a unique viewpoint or prior knowledge of the mirror and camera parameters. Instead, the method utilizes the structured environment of the soccer field to extract information from field lines for the calibration process. Multiple images are acquired under specific conditions to obtain a complete calibration.

1. Processo de Calibração

O processo proposto de calibração não paramétrica para sistemas catadióptricos genéricos, independentemente do tipo de espelho e da restrição imposta pela existência de um único ponto de vista, o que é bastante limitativo em relação à precisão da montagem do sistema. O método criado usa o ambiente estruturado desses robôs para o processo de calibração, sem precisar de um padrão externo de calibração. Desta maneira, usa a informação, embora incompleta, que as linhas do campo fornecem, usando várias imagens para realizar a calibração, imagens essas adquiridas sobre certas condições específicas.

2. Análise das Imagens e Extração de Características

Foi apresentado uma abordagem de detecção de linhas usando o pressuposto de que, mesmo em sistemas de visão omnidirecionais, as linhas retilíneas são localmente lineares na imagem. Com esta abordagem, a técnica base consiste em selecionar um pixel de linha e marcá-lo como um canto (início da linha) e, em seguida, procurar o pixel mais próximo correspondente a uma linha branca. Apenas os pixels adjacentes que pertencem a uma linha de campo branca são considerados vizinhos. Além disso, salvamos na memória a última direção obtida - posição do último pixel menos posição do anterior, normalizada para ter valor absoluto de um. Usando as últimas k direções salvas, calculamos a média para estimar a direção local da linha. Isso pode ser usado para prever a posição do pixel seguinte da linha. Não encontrando um pixel na posição esperada, outros vizinhos do último pixel são pesquisados e então o mais próximo da direção estimada e correspondente a uma linha branca é selecionado.

3. Identificação das Linhas

Para identificar as linhas extraídas, é proposto uma abordagem que analisa a direção local da linha para detectar cantos. Essa abordagem estima a direção média da linha, e a diferença de direção das extremidades de cada linha. Com isso, é possível diferenciar entre linhas sem cantos e linhas com cantos, permitindo a identificação correta das linhas do campo de futebol.

4. Extração da Estrutura do Campo de Futebol

Para extrair a estrutura do campo de futebol, o algoritmo proposto usa o conhecimento da estrutura do campo para identificar as linhas principais e os cantos. Ele começa pela detecção do círculo central, assumindo que ele é observável de todas as posições possíveis do robô no campo. Em seguida, usando as linhas do círculo central, o algoritmo extrai as linhas da lateral, linha do meio e áreas de pênalti e gol. Para isso, o algoritmo usa uma combinação de métodos de correspondência de linhas e análise de cantos para identificar as linhas corretamente.

5. Processo de Otimização

O processo de otimização estima uma função que mapeia pixels para pontos do campo, usando uma abordagem de mínimos quadrados lineares. Com isso, é possível calibrar o sistema de visão para obter uma correspondência precisa entre pixels da imagem e pontos do mundo real. O algoritmo proposto usa múltiplas imagens com diferentes posições do robô no campo para melhorar a precisão da calibração e compensar as distorções introduzidas pelo sistema de visão catadióptrico.

6. Avaliação dos Resultados

A avaliação dos resultados mostrou que o algoritmo proposto é capaz de calibrar com sucesso o sistema de visão catadióptrico, obtendo erros médios abaixo de 2 cm. Além disso, o algoritmo conseguiu calibrar bem a região central da imagem, mesmo com poucas imagens e uma função de baixa ordem. No entanto, para obter melhores resultados em áreas mais distantes da imagem, é sugerido o uso de uma função mais complexa ou a combinação de várias funções.

II.Literature Review

SVP-Constrained Approaches

Existing SVP-constrained approaches utilize a unified theory for central projection systems calibration. These approaches map 3D world points to the image plane and can describe various SVP vision systems, including combinations of orthographic cameras and parabolic mirrors, or perspective cameras with hyperbolic, elliptical, or planar mirrors. Camera and mirror alignment is essential in these systems, with specific requirements for orthographic cameras and perspective cameras.

Non-SVP Approaches

Non-SVP sensors overcome alignment constraints and offer flexibility in mirror shape and position. They enable the use of multi-part mirrors and single-camera stereo-vision. However, images captured by non-SVP systems cannot be rectified, and there is no exact closed-form function to transform omnidirectional images into perspective images.

1.2 Literature Review

Main approaches to the calibration of catadioptric systems:

Parametric: Estimate camera and mirror parameters as well as a mapping function
Non-parametric: Estimate only a mapping function

Constrained approaches:

Single View Point (SVP): Single point of view with mirror and camera axes aligned
Non-SVP: Overcomes alignment constraint, allowing multi-part mirrors and stereo-vision

Representative methods:

Unified projection model for central projection systems: Provides a model that maps points in 3D world to the image plane, allowing calibration of various SVP systems (e.g., parabolic mirror with orthographic camera, hyperbolic/elliptical/planar mirror with perspective camera)
Distance map around the robot's center: Calculated using back-propagation ray-tracing, compensating for misalignments
Planar grids for omnidirectional vision system calibration: Generalizes the typical grid-based calibration method for perspective cameras, using an extra distortion function to model misalignment and lens imperfections

Key considerations for non-parametric approaches:

Type of function for pixel-to-ground coordinates mapping: Polynomial approximation, splines, etc.
Calibration process constraints: Robot positions and image line selection

Proposed approach:

Non-parametric approach, using planar grids to calibrate the system
Utilizes soccer field lines for calibration, avoiding the need for external setup
Uses a polynomial function to fit the pixel to ground coordinates mapping
Optimizes the function parameters using a linear optimization method

1.3 Applied Computer Vision Techniques

Pinhole camera model:

Simple model that does not include lenses or their effects (e.g., geometric distortions or blurring)

Homogeneous transformations:

Used to model camera-to-mirror coordinate transformation, composed of a rotation matrix and a translation vector
Rotation matrix describes extrinsic rotations (yaw, pitch, roll) around fixed reference frame axes

Ray tracing:

Method to calculate the intersection of a ray with a mirror and the ground, enabling the determination of world points corresponding to image pixels
Considers mirror shape, misalignments, and camera model

Image processing techniques:

Color segmentation: Used to detect white field lines
Line tracking: Follows lines to detect their entire length, assuming local linearity in the image

III.Non Parametric Calibration

The proposed non-parametric calibration method involves optimizing a function that maps image pixels to world points (ground coordinates). This approach is particularly suitable for soccer applications, where a mapping between pixels and world points is necessary for localization. The intrinsic parameters of the camera, mirror, and their relative positions are irrelevant in this context. However, identifying the appropriate function and constraining the calibration process are crucial.

1. Introdução do Trabalho

Os sistemas de visão catadióptrica são úteis em muitas aplicações, como os robôs de futebol, geralmente requerendo uma boa calibração feita anteriormente. Os métodos paramétricos, por sua vez, não tentam estimar os parâmetros da câmera e do espelho, mas apenas uma função de mapeamento entre pontos de imagem e pontos do mundo. Nesse contexto, este trabalho propõe um método de calibração não paramétrica para qualquer tipo de sistema catadióptrico usado no futebol robótico, independentemente do tipo de espelho ou restrição de um único ponto de vista, o que limita bastante a precisão da montagem do sistema.

2. O Estado da Arte da Calibração

As abordagens com restrição de um único ponto de vista (SVP) e abordagens não-SVP são discutidas, bem como os métodos automáticos de calibração, as grades e os métodos não paramétricos, que não tentam estimar os parâmetros da câmera e do espelho, mas apenas uma função de mapeamento entre pontos de imagem e pontos do mundo.

3. Método de Calibração Não Paramétrica

O método de calibração proposto usa o ambiente estruturado dos robôs de futebol para o processo de calibração, sem precisar de um padrão externo de calibração. Dessa forma, usa a informação que as linhas do campo fornecem, embora incompleta, usando várias imagens para realizar a calibração, adquiridas sob certas condições específicas.

4. Experimentação e Resultados

Para avaliar o desempenho do método de calibração não paramétrica proposto, foram realizados experimentos de simulação e com imagens reais. Os resultados mostram que o método proposto pode obter uma calibração precisa com uma variedade de tipos de sistemas catadióptricos e condições de imagem.

5. Conclusão e Trabalho Futuro

O método de calibração não paramétrica é uma abordagem promissora para calibrar sistemas de visão catadióptrica em robôs de futebol. O método alcançou resultados de calibração precisos em experimentos de simulação e com imagens reais. Como trabalho futuro, os autores pretendem explorar ainda mais o uso de técnicas de aprendizado de máquina para melhorar a precisão e a robustez do método de calibração.

IV.Image Processing and Line Extraction

To extract field lines for calibration, image processing techniques are employed. Color segmentation is used to identify white field lines against the green background. Additionally, a line detection method is developed to handle incomplete lines and dashed lines, assuming that field lines are locally linear in the image. A tracking technique is used to follow and detect entire lines, even when temporarily obscured due to line breaks.

Post-processing Methods:

Once individual lines are detected, post-processing methods are applied to improve line identification.

Connecting unidentified line points: Lines are merged if they share similar characteristics and are likely part of the same line.
Expanding lines in both directions: Lines are extended from their extremities to account for occlusions or missing segments.
Merging intersecting lines: Lines that intersect without crossing are merged to create a single line.
Removing invisible lines: Lines consisting entirely of invisible points (due to occlusion or low resolution) are discarded.

1. Image Processing and Line Extraction

In this section, we propose a method to solve the issues of parametric calibration approaches for catadioptric systems: misalignments between camera and mirror and limited field of view. Using this approach, estimates are mapped between pixels and world points (ground) for use in the localization process.

In contrast to parametric approaches, non-parametric approaches estimate the parameters of a function that best fits the mapping, which is ideal for soccer applications that do not require knowledge of camera, mirror, or their relative positions. We can then find the type of function that best fits the mapping and the calibration process itself, namely its constraints.

Our approach is to first obtain calibration images from the robot's vision system, which we then use to extract field lines (using color segmentation) and identify their structure to retrieve information from the lines' position. Finally, we apply an optimization method that fits a function to the pixel to world coordinates mapping.

The advantages of our approach include its ability to be used with any type of camera and mirror, its independence of the robot's position, and its use of only field lines for calibration, which eliminates the need for external setup and allows for calibration even after mechanical changes to the robot.

We evaluate our approach using simulated images and compare it to existing parametric methods. Our results show that our approach can achieve comparable accuracy to parametric methods, even with strong misalignments and a limited field of view. Also, our approach is more robust to noise than parametric methods and can be used with any type of camera and mirror, making it a more versatile and practical solution for catadioptric system calibration.

V.Line Identification and Matching

To identify and match field lines, specific features and techniques are employed:

Center Circle Detection:

Assuming the center circle is visible from all robot positions, detection algorithms are used to identify it.

Line Adjacency Matrix:

A matrix is constructed to represent the adjacency relationships between detected lines.
This matrix is used to match lines to their corresponding field lines based on their relative positions and orientations.

Corner Detection:

Line corners are detected using a technique that analyzes local direction changes along the line.

Line Classification:

Lines are classified into different types (e.g., goal lines, penalty area lines) based on their adjacency relationships and corner positions.

1. Line Identification and Matching

The line identification and matching process consists of several steps:

Detection of the center circle: This is important for calibrating the central and middle radial regions of the image. The algorithm assumes it can see the center circle from any position in the field.
Extraction of other lines: Once the center circle is identified, the algorithm proceeds to extract other lines, such as the halfway line, side lines, goal lines, goal area lines, and penalty area lines.
Matching lines to the soccer field: The algorithm uses its knowledge of the soccer field structure to match the detected lines to their corresponding lines on the field, such as the halfway line, side lines, and goal lines.
Estimation of calibration points: For each detected line, the algorithm estimates the calibration point corresponding to the corners of the line. This information is used in the calibration process.

VI.Optimization and Calibration

Optimization Approach:

A linear least squares optimization is used to estimate the parameters of the mapping function that relates image pixels to world points.
To account for different robot positions in multiple calibration images, extrinsic coordinates are used to transform ground point correspondences to a common coordinate system.

Calibration Results:

Calibration results show that using multiple calibration images and an appropriate order for the mapping function leads to accurate calibration with low errors.
The calibration method can handle significant misalignments between the camera and mirror.
To ensure real-time performance, a lookup table is suggested to store the mapping function values for quick retrieval.

1. Optimization and Calibration

In this section we describe the proposed calibration procedure, implemented using Matlab. We use a non-parametric approach, as we only want to calibrate mapping between pixels and world points, while ignoring the intrinsic and extrinsic parameters of the system. This is specially useful in robot soccer, because this calibration process will be applied in the localization system, in which only that information is relevant. As our vision system was not designed for calibration purposes, it introduces some difficulties on the calibration process. One of them is the limited range of robot positions in the field, as not every possible robot position can be achieved, particularly the ones near the exterior boundary of the field. This problem becomes specially noticeable with calibration methods that use a predefined set of field points to optimize the extrinsic parameters, thus not being able to cope with the problem of having a non - uniformly distributed set of calibration points. In our case, as we are only interested in the mapping between pixels and ground points, we do not have this problem, since a non-parametric algorithm just tries to fit a function to the mapping from pixels to ground points, independent of the point distribution. We also present a method to detect and extract the lines present in images taken by a commonly used catadioptric vision system in robot soccer. This is due to the fact that the calibrated image will be used in the line detection process, and so it is important to optimize the robustness and quality of the line extraction. Then, considering that in the soccer field white lines are rectilinear, we can assume that locally, and even in catadioptric images, those lines are still locally rectilinear. Finally, the calibration method is presented, which is a least squares optimization that finds the parameters of a polynomial function that best fits the mapping from image pixels to world coordinates.

2. Calibration Procedure

To achieve the proposed calibration goal, it was necessary to develop the technique described in this section. Our approach uses the information provided by the soccer field white lines, in order to obtain a mapping between pixels and world coordinates, and so we only need to assume the camera and mirror are correctly aligned, and that the soccer field has a rectangular shape. We also need a reference image in which the pixel position for a set of points is known, and a test image in which we want to calculate the pixel to world point mapping. First, we will execute the line detection procedure for both images, thus obtaining the information about the soccer field white lines. In the reference image, we already know the pixel position of some field points, so we can calculate the ground point corresponding to that pixels. In the test image we make the same estimation, and finally we calculate the error between the predicted ground point and the real position of the ground points used for calibration, using the sum of squared errors. As we do not have the intrinsic parameters, we cannot apply the transformation from image coordinates to camera coordinates. Therefore, we build a homogeneous transformation to model the relative position between the image and world reference systems, calculated using a linear optimizer. We use a polynomial function to approximate the mapping, with a degree chosen in order to make the error acceptable for the defined field size. In order to use the ground information from different images, we also need to calculate the relative extrinsic transformation between the images, as explained in 5.3. With all that information, we perform a linear optimization to find the parameters of the chosen polynomial function. Therefore, in this calibration method there are no local optima, and so there is no need for iterative methods to estimate the parameters. This is an important advantage, specially if the order of the polynomial function is high. In practice the calibration images are not taken in a controlled environment, and so we cannot force the robot to be placed in a determined position. Consequently, another important issue is to use a world coordinate system applied to the reference image, but in some instances an extrinsic transformation could be different enough so as the world reference system needed to be re-created, thus using a different robot position for calibration. Therefore, to overcome this problem, we added to our calibration method the constrained least squares optimization, a variation of linear optimization that includes equality constraints.

VII.Future Work and Conclusion

Future Work:

Investigate the use of splines to achieve better calibration of both central and peripheral image regions.
Explore other types of non-parametric calibration methods.
Develop a method to estimate a mapping function that relates pixel positions to rays in the world coordinate system.

Conclusion:

The proposed non-parametric calibration method is suitable for catadioptric systems in robotic soccer applications. It provides accurate calibration results for various types of catadioptric systems with significant misalignments. The method does not require a unique viewpoint or prior knowledge of the mirror and camera parameters, and it utilizes the structured environment of the soccer field for calibration. The results demonstrate the effectiveness of the method in obtaining a mapping between image pixels and world points, which is essential for localization in robotic soccer.

1. Future Work

This section outlines potential future research directions for the work presented in the thesis, including:

Extending the method to handle polar coordinates for lines that are significantly curved due to misalignments.
Developing a more robust method for line detection and identification, particularly for images with low resolution and blurred lines due to defocusing in the peripheral regions.
Investigating the use of alternative non-parametric calibration methods, such as estimating a function that relates pixel positions to rays in the world coordinate system.

2. Conclusion

The thesis presents a non-parametric calibration method that is applicable to any catadioptric vision system used in robotic soccer, regardless of the mirror type or the presence of a single viewpoint constraint. The method utilizes the structured environment of robotic soccer fields to perform the calibration process without the need for external calibration patterns. The proposed method extracts information from the field lines and combines multiple images, taken under specific conditions, to achieve calibration. The results demonstrate the effectiveness of the non-parametric approach, showing that it can achieve accurate calibration with a relatively low order of the estimation function. However, further improvements can be made by exploring alternative line detection and identification techniques and investigating more advanced non-parametric calibration methods.

Referência do documento

Calibration of non-svp hyperbolic catadioptric robotic vision systems (B. Cunha, J. Azevedo, and N. Lau)
Calibration method for misaligned catadioptric camera (T. Mashita, Y. Iwai, and M. Yachida)