Extreme learning to handle big data PhD

Updated: 3 days ago
Location: Cranfield, ENGLAND

A funded PhD studentship is available within the Autonomous and Cyber Physical Systems Centre at Cranfield University, Bedfordshire, UK. Read moreRead less

As aerospace platforms go through their service life, gradual performance degradations and unwarranted system failures can occur. There is certain physical information known a priori in such aerospace platform operations. The main research hypothesis to be tested in this research is that it should be possible to significantly improve the performance of extreme learning and assure safe and reliable maintenance operation by integrating this prior knowledge into the learning mechanism.

The integrating should enable to guarantee certain properties of the learned functions, while keep leveraging the strength of the data-driven modelling. Most of, if not all, the traditional statistical methods are not suitable for big data due to their certain characteristics: heterogeneity, statistical biases, noise accumulations, spurious correlation, and incidental endogeneity.  Therefore, big data demands new statistical thinking and methods. As data size increases, each feature and parameter also becomes highly correlated. Then, their relations get highly complicated too and hidden patterns of big data might not be possible to be captured by traditional modelling approaches.

This implies that mathematical modelling of such data is infeasible. The data-driven modelling approach could resolve this issue and we could use obtain data-driven models using machine learning algorithms such as artificial neural networks, reinforcement learning, and deep learning.  A typical caveat of data-driven modelling using learning algorithms as Extreme Learning Machine (ELM) is that training data should cover the entire domain of process parameters to achieve accurate generalization of the trained model to new process configurations. In practice, this might not be possible, that is the sample data could cover only some space, not entire space, of process parameters. Integrating prior knowledge into the learning could enable accurate generalization of the data-driven model even when the space of system parameters is only sampled sparsely.

Consequently, it will improve the performance of the learning. Integration of the prior knowledge of the system into the learning procedure will be quite challenging since the key enabler of its very powers is the universal approximation capabilities.  Sampled data are generally noisy, outliers occur, and there always exist a risk of overfitting corrupted data. Therefore, the learned function may violate a constraint that is present in the ideal function, from which the training data sampled. This PhD study will address this research challenge.

<p">Cranfield is the largest academic centre for postgraduate studies in Science and Technology in the UK. Focused on developing applied research to meet the demands of industry. Ranked in the top five UK universities for commercial income for research, Cranfield is able to invest in and has become home to many world-class, large-scale facilities, which enhance our teaching and research.

Our specialist areas of focus, or Cranfield themes, are where we bring a range of academic disciplines together in order to tackle the grand challenges facing the world within a range of industrial and commercial sectors. These are Water, Agrifood, Energy and Power, Aerospace, Manufacturing, transport systems, Defence and Security and management.

View or Apply

Similar Positions