Unknown

Dataset Information

0

Semi-supervised learning framework for oil and gas pipeline failure detection.


ABSTRACT: Quantifying failure events of oil and gas pipelines in real- or near-real-time facilitates a faster and more appropriate response plan. Developing a data-driven pipeline failure assessment model, however, faces a major challenge; failure history, in the form of incident reports, suffers from limited and missing information, making it difficult to incorporate a persistent input configuration to a supervised machine learning model. The literature falls short on the development of appropriate solutions to utilize incomplete databases and incident reports in the pipeline failure problem. This work proposes a semi-supervised machine learning framework which mines existing oil and gas pipeline failure databases. The proposed cluster-impute-classify (CIC) approach maps a relevant subset of the failure databases through which missing information in the incident report is reconstructed. A classifier is then trained on the fly to learn the functional relationship between the descriptors from a diverse feature set. The proposed approach, presented within an ensemble learning architecture, is easily scalable to various pipeline failure databases. The results show up to 91% detection accuracy and stable generalization ability against increased rate of missing information.

SUBMITTER: Alobaidi MH 

PROVIDER: S-EPMC9374783 | biostudies-literature | 2022 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Semi-supervised learning framework for oil and gas pipeline failure detection.

Alobaidi Mohammad H MH   Meguid Mohamed A MA   Zayed Tarek T  

Scientific reports 20220812 1


Quantifying failure events of oil and gas pipelines in real- or near-real-time facilitates a faster and more appropriate response plan. Developing a data-driven pipeline failure assessment model, however, faces a major challenge; failure history, in the form of incident reports, suffers from limited and missing information, making it difficult to incorporate a persistent input configuration to a supervised machine learning model. The literature falls short on the development of appropriate solut  ...[more]

Similar Datasets

2019-11-13 | GSE140262 | GEO
| S-EPMC5441628 | biostudies-literature
| S-EPMC8058768 | biostudies-literature
| S-EPMC10802372 | biostudies-literature
| S-EPMC8123111 | biostudies-literature
| S-EPMC9731861 | biostudies-literature
| S-EPMC11522217 | biostudies-literature
| S-EPMC8520253 | biostudies-literature
| S-EPMC10131478 | biostudies-literature
| S-EPMC8570780 | biostudies-literature