Thursday, 3 August 2023

Researchers successfully train a machine learning model in outer space for the first time

 For the first time, a project led by the University of Oxford has trained a machine learning model in outer space, on board a satellite. This achievement could revolutionise the capabilities of remote-sensing satellites by enabling real-time monitoring and decision making for a range of applications.

Data collected by remote-sensing satellites is fundamental for many key activities, including aerial mapping, weather prediction, and monitoring deforestation. Currently, most satellites can only passively collect data, since they are not equipped to make decisions or detect changes. Instead, data has to be relayed to Earth to be processed, which typically takes several hours or even days. This limits the ability to identify and respond to rapidly emerging events, such as a natural disaster.

To overcome these restrictions, a group of researchers led by DPhil student Vít Růžička (Department of Computer Science, University of Oxford), took on the challenge of training the first machine learning program in outer space. During 2022, the team successfully pitched their idea to the Dashing through the Stars mission, which had issued an open call for project proposals to be carried out on board the ION SCV004 satellite, launched in January 2022. During the autumn of 2022, the team uplinked the code for the program to the satellite already in orbit.

The researchers trained a simple model to detect changes in cloud cover from aerial images directly onboard the satellite, in contrast to training on the ground. The model was based on an approach called few-shot learning, which enables a model to learn the most important features to look for when it has only a few samples to train from. A key advantage is that the data can be compressed into smaller representations, making the model faster and more efficient.

Vít Růžička explained: 'The model we developed, called RaVAEn, first compresses the large image files into vectors of 128 numbers. During the training phase, the model learns to keep only the informative values in this vector; the ones that relate to the change it is trying to detect (in this case, whether there is a cloud present or not). This results in extremely fast training due to having only a very small classification model to train.'

Whilst the first part of the model, to compress the newly-seen images, was trained on the ground, the second part (which decided whether the image contained clouds or not) was trained directly on the satellite.

Normally, developing a machine learning model would require several rounds of training, using the power of a cluster of linked computers. In contrast, the team's tiny model completed the training phase (using over 1300 images) in around one and a half seconds.

sources-science daily

No comments:

Post a Comment