Hi there! I'm

Manuel Pérez-Carrasco

CTO at the Center for Data and Artificial Intelligence UdeC.
My research focuses on mapping natural resources and environmental monitoring using earth observations, as well as characterizing astronomical objects from survey telescopes. I specialize in computer vision, time series analysis, and natural language processing.
Keep looking for more information about my work :)

Selected Projects

Methane Detection

@Harvard CfA / EDF

DeepHub

@UdeC

Anomaly Detection

@ALeRCE

Galaxy Morphologies

@UdeC/Harvard U.

Predictive Maintenance

@CMPC

Domain Adaptation

@UdeC/Harvard IACS

Chilean Constitution

@UDS

Wearables

@Biomedical Lab, UdeC

@UdeC/Harvard U. (2020/2 - 2021-1)

Con2DA: Simplifying Semi-supervised Domain Adaptation by Learning Consistent and Contrastive Feature Representations

In this work we explored the using of different data augmentation and semi-supervised techniques in order to improve generalization in the semi-supervised domain adaptation context. In this work we show that (i) learning of the consistent and contrastive feature representations is crucial to extract good discriminative features across different domains, and ii) our model benefits from the use of strong augmentation policies. With these findings, our method achieves state-of-the-art performances in three benchmark datasets for SSDA. This work was presented in the NeurIPS 2021 Workshop on Distribution Shifts.

Paper Cite

@UdeC/Harvard U. (2018/2 - 2019/1)

Masters Thesis: Semi-supervised Domain Adaptation for Image Classification Through Generative Models

Deep learning models have demonstrated to be very good at discovering underlying patterns in the data. But for these models works correctly it is usually needed a large amount of annotated data in order to avoid the overfitting and be able to generalize over new data. Unfortunately, in real-life cases, labeling data is costly and time-consuming. Domain adaptation models aim at generalizing a model trained in a vast annotated source to perform well in a low annotated target by finding a common-shared representation between them.

I started this project as my master thesis adviced by Guillermo Cabrera, and Pavlos Protopapas. We tackled the problem using recent advances in semi-supervised learning and generative models, and our model outperformed the state-of-the-art in low-dimensional benchmarks. This work was awarded Best Student Paper Award at the Astroinformatics 2019 Conference.

Preprint Presentation Cite

@Automatic Learning for the Rapid Classification of Events (ALeRCE) - (2020/1 - 2022/1)

Anomaly Detection Algorithm for Astronomical Light Curves

"We are a way for the universe to know itself." - Carl Sagan

ALeRCE is an interinstitutional and interdisciplinary astronomical broker which uses a Cloud Infrastructure and Machine Learning for processing variable stars alerts from the Zwicky Transient Facility (ZTF) survey, and which aims to become a Community Broker for the Vera Rubin Observatory, which will have the largest digital camera ever fabricated for optical astronomy!

In ALeRCE I was responsible for implementing an anomaly detection algorithm using the astronomical light curves coming from the variable star alerts. The idea is to detect objects that do not belong to the ALeRCE's taxonomy and send alerts to the astronomers for further follow-up and astrophysical studies. Our main goal is to be the firsts in detecting unknown astronomical objects, thereby contributing to a better understanding of new astrophysical phenomena that describes the evolution of our universe.

White Paper Presentation Video ALeRCE Repository

Anomaly Detection Paper Anomaly Detection Repository BibTex

@Universidad de Concepción - (2019-2)

DeepHub

The wood industry is one of the most important economical sectors in Chile, representing almost one sixth of total exports, placing it as the second-largest export sector of the country. In this context, ARAUCO is one of the bigger wood companies in South America, and an important competitor globally.

For this sector, the information about the number of trees in a field is essential because it helps to estimate productivity, evaluate the density of the plantations, and detect errors occurring during the seedling process, opening the possibility for efficient replanting.

Deep-Hub is a software that uses Cloud Infrastructure and Deep Learning algorithms for labeling, training, and predicting statistics using high-dimensional geospatial data. For this project I designed and implemented Deep Learning algorithms that solve the aforementioned problems. I developed two algorithms based on state-of-the-art works. One was YOLOv3, a famous algorithm for real-time object detection in images that allow us to detect trees and therefore count them. The second algorithm was Mask RCNN, an instance-segmentation algorithm, which detect trees and segmentate them at the same time, allowing us to estimate the density of the plantations. Also, I designed the required algorithms that works as an interface between the deep learning algoritms and high-dimensional data. Our approach presents a low-cost solution, in contrast to expensive multispectral, hyperspectral, and LiDAR-based solutions.

Paper published on the International Journal of Digital Earth (IJDE), 2022

Paper Cite

@UdeC/Harvard U. (2017/2 - 2018/1)

Undergrad Thesis: Attributes Transfer in Deep Neural Network Applied to Astronomical Images

The galaxy evolution through time is one of the key questions of modern astrophysics and the study of their morphologies represents a fundamental and powerful diagnosis to evolutionary changes of galaxies.

Thus, advised by Guillermo Cabrera, during my undergraduated honor research thesis we created a new catalog of galaxy morphologies for "Cluster Lensing and Supernova survey with Hubble (CLASH)" using deep neural networks.

One of the problem of this approach is the data required a lot of time of expert astronomers to be labeled. Fortunately sometimes it is possible to find similar or related other sources available and we can take advantage of them to make our few labeled target data performs better. For this project, we took advantage of a vast labeled source named "The Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey (CANDELS)" consisting of images associated with 5 different galaxy morphologies labeled by expert using visual inspection. We used the same taxonomy and created a few eyeball labels. As in some cases there exist disagreement between expert opinions, we handled as regression problem. Moreover, we used different deep learning architectures and a transfer learning technique named fine-tuning, to finally show the effectiveness of this method. Finally, we released a public Catalog of 8,412 galaxies in 25 different photometrics filters from CLASH.

Paper Repository Cite

@MapaConstituyente - (2022/1)

Understanding Social Requirements for the upcoming Chilean Constitution

The upcoming Chilean constitution represents one of the huge challenges for the country, specially regarding the analysis and understanding of the citizen's requirements and desires. For that reason, we carried out an analysis of the councils and citizen proposals in order to understand the primary requirements of the people. This analysis served as a statistical basis in the discussion for the generation of articles within the constitution. In this project, I led the technical development and analysis team.

This project was requested by Chilean constituents.

Website

@El Chile que Queremos - (2021/2)

Understanding the Chilean Social Outbreak

El Chile que Queremos (The Chile we Want) is an interinstitutional initiative between four Chilean universities (UdeC, UChile, PUC, UDD). Our objective is to investigate and understand the emotions and social needs that caused the 2019's Chilean social outbreak and generated the upcoming change in the Chilean constitution. In this project I led the technical development and analysis team.

This project was requested by the Chilean Ministry of Social Development and Sciences.

Repository

@Biomedical Laboratory, UdeC - (2020/1)

Real-time EEG-based Drowsiness Detection Algorithm

Drowsy driving is a major problem in road safety. This issue is responsible for a huge number of crashes, injuries, and deaths along the world.

Working at the Biomedical Laboratory of UdeC, I was responsible for implementing a machine learning system for drowsiness detection using electroencephalography (EEG) signals extracted from a 16-channels g.Nautilus EEG headset. I faced some challenges, such as the need for a reduced number of channels to consider the ergonomic nature of the system, the high cost of the signal labeling process, and the need for real-time detection. Thereby, we are implemented a transfer learning algorithm that learns from vast annotated public sources and uses that knowledge to improve the performance of our low annotated signals. Thus, we efficiently reduced the number of channels, while maintaining the same labeling efforts. Moreover, the system is running almost in real-time!

This project was funded by the "Fund for the Promotion of Scientific and Technological Development" (FONDEF). Unfortunately, the repository will not be publicly released.

@CMPC Company

Intelligent System for Predictive Maintenance

The pulp and paper industry is a critical manufacturing sector in Chile that requires continuous monitoring of production machinery to prevent costly downtime and ensure operational efficiency. For CMPC, one of Latin America's largest pulp and paper manufacturers, early detection of potential equipment failures is essential to maintain production levels and reduce maintenance costs.

Our team developed an intelligent system for predictive maintenance in pulp drying machines using advanced ML-powered monitoring and analytics. The system integrates real-time sensor data collection with deep learning algorithms to detect anomalous patterns that might indicate impending equipment failures before they occur.

The analytical solution features real-time data visualization and monitoring, digital feedback integration, action logging and historical data access, enhanced visual reporting with customizable dashboards, and an automated alert system for anomaly detection.

Status: Operational monitoring Result: Normal operation with enhanced decision support

@Harvard Center for Astrophysics - Environmental Defense Fund

Advanced ML Algorithms for Methane Monitoring

Environmental monitoring requires accurate retrieval of methane (CH₄) concentrations. The MethaneSAT mission, launched in April 2024, aims to bridge the gap between global mapping satellites with low spatial resolution and high-resolution point-source imagers.

Our team developed advanced machine learning algorithms to address one of the most significant challenges in atmospheric methane retrievals: the accurate detection and mitigation of cloud and shadow effects in hyperspectral satellite imagery.

We conducted a comprehensive evaluation of machine learning approaches for cloud and shadow detection in MethaneSAT and MethaneAIR hyperspectral data, comparing traditional methods with advanced deep learning architectures.

This research directly supports global methane monitoring efforts by improving the reliability of satellite-based measurements, enabling more accurate identification and quantification of emission sources.

Work in progress...

Manuel.

Manuel.

Hi there! I'm

Manuel Pérez-Carrasco

Selected Projects

Methane Detection

DeepHub

Anomaly Detection

Galaxy Morphologies

Predictive Maintenance

Domain Adaptation

Chilean Constitution

Wearables