Nikolaos (Nikos) Nikolaou

University College London, Physics and Astronomy, Post-Doc

The University of Manchester, School of Computer Science, Visiting Researcher

Followers

Following

Co-authors

Public Views

Senior Research Fellow, UCL

less

InterestsView All (29)

Uploads

Papers by Nikolaos (Nikos) Nikolaou

The Astronomical Journal, Feb 18, 2020

Download

Don't Pay Attention to the Noise: Learning Self-supervised Representations of Light Curves with a Denoising Time Series Transformer

arXiv (Cornell University), Jul 6, 2022

Download

Mapping Mineralogical Distributions on Mars with Unsupervised Machine Learning

Download

Margin Maximization as Lossless Maximal Compression

arXiv (Cornell University), Jan 28, 2020

Download

Better Boosting with Bandits for Online Learning

arXiv (Cornell University), Jan 16, 2020

Download

Ariel x NeurIPS Competition - Inferring Physical Properties of Exoplanets From Next-Generation Telescopes

&lt;p&gt;The study of extra-solar planets, or simply, exoplanets, &amp;#160;planets o... more &lt;p&gt;The study of extra-solar planets, or simply, exoplanets, &amp;#160;planets outside our own Solar System, is fundamentally a grand quest to understand our place in the Universe. Discoveries in the last two decades have re-defined what we know about planets, and helped us comprehend the uniqueness of our very own Earth. In recent years, however, the focus has shifted from planet detection to planet characterisation, where key planetary properties are inferred from telescope observations using Monte Carlo-based methods. However, the efficiency of sampling-based methodologies is put under strain by the high-resolution observational data from next generation telescopes, such as the James Webb Space Telescope and the Ariel Space Mission. We propose to host a regular competition with the goal of identifying a reliable and scalable method to perform planetary characterisation. Depending on the chosen track, participants will provide either quartile estimates or the approximate distribution &amp;#160;of key planetary properties. They will have access to synthetic spectroscopic data generated from the official simulators for the ESA Ariel Space Mission. The aims of the competition are three-fold. 1) To offer a challenging application for comparing and advancing conditional density estimation methods. 2) To provide a valuable contribution towards reliable and efficient analysis of spectroscopic data, enabling astronomers to build a better picture of planetary demographics, and 3) To promote the interaction between ML and exoplanetary science.&lt;/p&gt; &lt;p&gt;The competition is open for all and is expected to run from July to October. We will provide a brief introduction to the competition, its aim and the different tracks available for participants. We will also be sharing preliminary results from the competition in this session.&lt;/p&gt; &lt;p&gt;&amp;#160;&lt;/p&gt;

Correcting Transiting Exoplanet Light Curves for Stellar Spots: A Machine Learning Challenge for the ESA Ariel Space Mission

AAS/Division for Extreme Solar Systems Abstracts, Aug 1, 2019

Download

Peeking inside the Black Box: Interpreting Deep-learning Models for Exoplanet Atmospheric Retrievals

The Astronomical Journal, Oct 13, 2021

Download

Fast Optimization of Non-convex Machine Learning Objectives

Download

Gradient Boosting Models for Photovoltaic Power Estimation Under Partial Shading Conditions

Lecture Notes in Computer Science, 2017

Download

Cost-sensitive boosting algorithms: Do we really need them?

Machine Learning, 2016

Download

ESA-Ariel Data Challenge NeurIPS 2022: Inferring Physical Properties of Exoplanets From Next-Generation Telescopes

Download

The Astronomical Journal, 2020

Download

Information Theoretic Feature Selection in Multi-label Data through Composite Likelihood

Lecture Notes in Computer Science, 2014

Download

Pushing the Limits of Exoplanet Discovery via Direct Imaging with Deep Learning

Machine Learning and Knowledge Discovery in Databases, 2020

Download

Calibrating AdaBoost for asymmetric learning

Abstract. Asymmetric classification problems are characterized by class imbalance or unequal cost... more Abstract. Asymmetric classification problems are characterized by class imbalance or unequal costs for different types of misclassifications. One of the main cited weaknesses of AdaBoost is its perceived inability to handle asymmetric problems. As a result, a multitude of asymmetric versions of AdaBoost have been proposed, mainly as heuristic modifications to the original algorithm. In this paper we challenge this approach and propose instead handling asymmetric tasks by properly calibrating the scores of the original AdaBoost so that they correspond to probability estimates. We then account for the asymmetry using classic decision theoretic ap-proaches. Empirical comparisons of this approach against the most repre-sentative asymmetric Adaboost variants show that it compares favorably. Moreover, it retains the theoretical guarantees of the original AdaBoost and it can easily be adjusted to account for changes in class imbalance or costs without need for retraining.

Download

Cost-sensitive boosting: a unified approach

Download

Better Boosting with Bandits for Online Learning

ArXiv, 2020

Probability estimates generated by boosting ensembles are poorly calibrated because of the margin... more Probability estimates generated by boosting ensembles are poorly calibrated because of the margin maximization nature of the algorithm. The outputs of the ensemble need to be properly calibrated before they can be used as probability estimates. In this work, we demonstrate that online boosting is also prone to producing distorted probability estimates. In batch learning, calibration is achieved by reserving part of the training data for training the calibrator function. In the online setting, a decision needs to be made on each round: shall the new example(s) be used to update the parameters of the ensemble or those of the calibrator. We proceed to resolve this decision with the aid of bandit optimization algorithms. We demonstrate superior performance to uncalibrated and naively-calibrated on-line boosting ensembles in terms of probability estimation. Our proposed mechanism can be easily adapted to other tasks(e.g. cost-sensitive classification) and is robust to the choice of hyper...

Download

Fast Regression of the Tritium Breeding Ratio in Fusion Reactors

ArXiv, 2021

The tritium breeding ratio (TBR) is an essential quantity for the design of modern and next-gener... more The tritium breeding ratio (TBR) is an essential quantity for the design of modern and next-generation D-T fueled nuclear fusion reactors. Representing the ratio between tritium fuel generated in breeding blankets and fuel consumed during reactor runtime, the TBR depends on reactor geometry and material properties in a complex manner. In this work, we explored the training of surrogate models to produce a cheap but high-quality approximation for a Monte Carlo TBR model in use at the UK Atomic Energy Authority. We investigated possibilities for dimensional reduction of its feature space, reviewed 9 families of surrogate models for potential applicability, and performed hyperparameter optimisation. Here we present the performance and scaling properties of these models, the fastest of which, an artificial neural network, demonstrated R =0.985 and a mean prediction time of 0.898μs, representing a relative speedup of 8 · 10 with respect to the expensive MC model. We further present a nov...

Download

Lessons Learned from the 1st ARIEL Machine Learning Challenge: Correcting Transiting Exoplanet Light Curves for Stellar Spots

ArXiv, 2020

The last decade has witnessed a rapid growth of the field of exoplanet discovery and characterisa... more The last decade has witnessed a rapid growth of the field of exoplanet discovery and characterisation. However, several big challenges remain, many of which could be addressed using machine learning methodology. For instance, the most prolific method for detecting exoplanets and inferring several of their characteristics, transit photometry, is very sensitive to the presence of stellar spots. The current practice in the literature is to identify the effects of spots visually and correct for them manually or discard the affected data. This paper explores a first step towards fully automating the efficient and precise derivation of transit depths from transit light curves in the presence of stellar spots. The methods and results we present were obtained in the context of the 1st Machine Learning Challenge organized for the European Space Agency's upcoming Ariel mission. We first present the problem, the simulated Ariel-like data and outline the Challenge while identifying best pra...

Download

The Astronomical Journal, Feb 18, 2020

Download

Don't Pay Attention to the Noise: Learning Self-supervised Representations of Light Curves with a Denoising Time Series Transformer

arXiv (Cornell University), Jul 6, 2022

Download

Mapping Mineralogical Distributions on Mars with Unsupervised Machine Learning

Download

Margin Maximization as Lossless Maximal Compression

arXiv (Cornell University), Jan 28, 2020

Download

Better Boosting with Bandits for Online Learning

arXiv (Cornell University), Jan 16, 2020

Download

Ariel x NeurIPS Competition - Inferring Physical Properties of Exoplanets From Next-Generation Telescopes

Correcting Transiting Exoplanet Light Curves for Stellar Spots: A Machine Learning Challenge for the ESA Ariel Space Mission

AAS/Division for Extreme Solar Systems Abstracts, Aug 1, 2019

Download

Peeking inside the Black Box: Interpreting Deep-learning Models for Exoplanet Atmospheric Retrievals

The Astronomical Journal, Oct 13, 2021

Download

Fast Optimization of Non-convex Machine Learning Objectives

Download

Gradient Boosting Models for Photovoltaic Power Estimation Under Partial Shading Conditions

Lecture Notes in Computer Science, 2017

Download

Cost-sensitive boosting algorithms: Do we really need them?

Machine Learning, 2016

Download

ESA-Ariel Data Challenge NeurIPS 2022: Inferring Physical Properties of Exoplanets From Next-Generation Telescopes

Download

The Astronomical Journal, 2020

Download

Information Theoretic Feature Selection in Multi-label Data through Composite Likelihood

Lecture Notes in Computer Science, 2014

Download

Pushing the Limits of Exoplanet Discovery via Direct Imaging with Deep Learning

Machine Learning and Knowledge Discovery in Databases, 2020

Download

Calibrating AdaBoost for asymmetric learning

Download

Cost-sensitive boosting: a unified approach

Download

Better Boosting with Bandits for Online Learning

ArXiv, 2020

Download

Fast Regression of the Tritium Breeding Ratio in Fusion Reactors

ArXiv, 2021

Download

Lessons Learned from the 1st ARIEL Machine Learning Challenge: Correcting Transiting Exoplanet Light Curves for Stellar Spots

ArXiv, 2020

Download

Cost-Sensitive Boosting: A Unified Approach

PhD Thesis, University of Manchester, 2016

In this thesis we provide a unifying framework for two decades of work in an area of Machine Lear... more In this thesis we provide a unifying framework for two decades of work in an area of Machine Learning known as cost-sensitive Boosting algorithms. This area is concerned with the fact that most real-world prediction problems are asymmetric, in the sense that different types of errors incur different costs.Adaptive Boosting (AdaBoost) is one of the most well-studied and utilised algorithms in the field of Machine Learning, with a rich theoretical depth as well as practical uptake across numerous industries. However, its inability to handle asymmetric tasks has been the subject of much criticism. As a result, numerous cost-sensitive modifications of the original algorithm have been proposed. Each of these has its own motivations, and its own claims to superiority.With a thorough analysis of the literature 1997-2016, we find 15 distinct cost-sensitive Boosting variants - discounting minor variations. We critique the literature using {\em four} powerful theoretical frameworks: Bayesian decision theory, the functional gradient descent view, margin theory, and probabilistic modelling.From each framework, we derive a set of properties which must be obeyed by boosting algorithms. We find that only 3 of the published Adaboost variants are consistent with the rules of all the frameworks - and even they require their outputs to be calibrated to achieve this.Experiments on 18 datasets, across 21 degrees of cost asymmetry, all support the hypothesis - showing that once calibrated, the three variants perform equivalently, outperforming all others.Our final recommendation - based on theoretical soundness, simplicity, flexibility and performance - is to use the original Adaboost algorithm albeit with a shifted decision threshold and calibrated probability estimates. The conclusion is that novel cost-sensitive boosting algorithms are unnecessary if proper calibration is applied to the original

Download

Fast optimization of non-convex Machine Learning objectives

MSc Thesis, University of Edinburgh, 2012

In this project we examined the problem of non-convex optimization in the context of Machine Lear... more In this project we examined the problem of non-convex optimization in the context of Machine Learning, drawing inspiration from the increasing popularity of methods such as Deep Belief Networks, which involve non-convex objectives. We focused on the task of training the Neural Autoregressive Distribution Estimator, a recently proposed variant of the Restricted Boltzmann Machine, in applications to density estimation. The aim of the project was to explore the various stages involved in implementing optimization methods and choosing the appropriate one for a given task. We examined a number of optimization methods, ranging from derivative-free to second order and from batch to stochastic. We experimented with variations of these methods, presenting along the way all the major steps and decisions involved. The challenges of the problem included the relatively large parameter space and the non-convexity of the objective function, the large size of some of the datasets we used, the multitude of hyperparameters and decisions involved in each method, as well as the ever-present danger of overfitting the data. Our results show that second order Quasi-Newton batch methods like L-BFGS and variants of stochastic first order methods like Averaged Stochastic Gradient Descent outshine the rest of the methods we examined.

Download

Music Emotion Classification

M.Eng. Thesis, Technical University of Crete, 2011

In this thesis we focus on the automatic emotion classification of music samples. We extract a se... more In this thesis we focus on the automatic emotion classification of music samples. We extract a set of features from the music signal and examine their discriminatory capability using various classification techniques. Our goal is to determine the features and the classification methods that lead to the best classification of the emotion a music sample conveys. During the course of the thesis, we generated
our own dataset of annotated song samples and we examined two distinct methods of describing an emotion: using clusters consisting of various emotional states, and using a two-dimensional representation of the emotion in the Valence-Activation plane. The latter method was chosen as the most successful. We also tried other approaches of music emotion classification (MEC) as well, such as treating the song sample as an amplitude and frequency modulated (AM-FM) signal, on which we subsequently perform multiband demodulation analysis (MDA) testing various Gabor filter banks (Mel scale-based filter bank, Bark scale-based filter bank, and a number of fractional octave-based filter banks). Statistics of the Frequency Modulation Percentages (FMPs) of each band derived from the demodulation, proved to be quite successful features in the classification of emotion. Finally, we
explored other modalities besides the music sound signal itself, such as a number of features derived from the chords of the song samples, classification of the song samples' lyrics using various techniques and a brief investigation of Electroencephalogram (EEG) data generated by one of the annotators
while performing the annotation of the song samples. Our final feature-pack included a combination of the most successful features among the ones we studied: (i) music-inspired features (features based on music theory and psychoacoustics, derived from either the sound signal or the chords of the sample), (ii) statistics of the FMPs and (iii) statistics of the Mel-frequency cepstral coefficients (MFCCs). This feature-pack proved to be more robust than its three individual components and in the end we achieved results that reached 85.7% correct classification rate in the dimension of Valence and 85.1% correct classification rate in the dimension of Activation. We finally demonstrate that by discarding training samples that are assigned a label too close to the neutral value, our results can improve even further, especially in the dimension of Activation.

Download

Margin Maximisation as Lossless Maximal Compression

The ultimate goal of a supervised learning algorithm is to produce models constructed on the trai... more The ultimate goal of a supervised learning algorithm is to produce models constructed on the training data that can generalize well to new examples. In classifica- tion, functional margin maximization – correctly classifying as many training examples as possible with maximal confidence – has been known to construct models with good generalization guarantees. This work gives an information-theoretic interpretation of a margin maximizing model on a noiseless training dataset as one that achieves lossless maximal compression of said dataset – i.e. extracts from the features all the useful information for predicting the label and no more. The connection offers new insights on generalization in supervised machine learning, showing margin maximization as a special case (that of classification) of a more general principle and explains the success and potential limitations of popular learning algorithms like gradient boosting. We support our observations with theoretical arguments and empirical evidence and identify interesting directions for future work.

Download

Better Boosting with Bandits for Online Learning

by Nikolaos (Nikos) Nikolaou and Joseph Mellor

ArXiV Preprint, 2020

Probability estimates generated by boosting ensembles are poorly calibrated because of the margin... more Probability estimates generated by boosting ensembles are poorly calibrated because of the margin maxi-mization nature of the algorithm. The outputs of the ensemble need to be properly calibrated before they can be used as probability estimates. In this work, we demonstrate that online boosting is also prone to producing distorted probability estimates. In batch learning, calibration is achieved by reserving part of the training data for training the calibrator function. In the online setting, a decision needs to be made on each round: shall the new example(s) be used to update the parameters of the ensemble or those of the calibrator. We proceed to resolve this decision with the aid of bandit optimization algorithms. We demonstrate superior performance to uncalibrated and naively-calibrated on-line boosting ensembles in terms of probability estimation. Our proposed mechanism can be easily adapted to other tasks(e.g. cost-sensitive classification) and is robust to the choice of hyperparameters of both the calibrator and the ensemble.

Download

Pushing the Limits of Exoplanet Discovery via Direct Imaging with Deep Learning

by Nikolaos (Nikos) Nikolaou, Mario Morvan, and Ingo Waldmann

ECML-PKDD, 2019

Further advances in exoplanet detection and characterisa-tion require sampling a diverse populati... more Further advances in exoplanet detection and characterisa-tion require sampling a diverse population of extrasolar planets. One technique to detect these distant worlds is through the direct detection of their thermal emission. The so-called direct imaging technique, is suitable for observing young planets far from their star. These are very low signal-to-noise-ratio (SNR) measurements and limited ground truth hinders the use of supervised learning approaches. In this paper, we combine deep generative and discriminative models to bypass the issues arising when directly training on real data. We use a Generative Adversarial Network to obtain a suitable dataset for training Convolutional Neural Network classifiers to detect and locate planets across a wide range of SNRs. Tested on artificial data, our detectors exhibit good predictive performance and robustness across SNRs. To demonstrate the limits of the detectors, we provide maps of the precision and recall of the model per pixel of the input image. On real data, the models can reconfirm bright source detections.

Download

Nikolaos (Nikos) Nikolaou

Uploads

Papers by Nikolaos (Nikos) Nikolaou

Log In