Proteomics

Dataset Information

0

Combining proteomics and machine learning to identify triple negative breast cancer biomarkers binding telomeric G-Quadruplex


ABSTRACT: Triple-negative breast cancer (TNBC) lacks ER, PR and HER2 expression, represents ~10–20% of invasive breast cancers, and is clinically aggressive with limited targeted treatment options. Its pronounced molecular heterogeneity challenges single-marker diagnostics, motivating the development of robust biomarker panels for early detection and disease monitoring. Aptamers, and particularly guanine-rich DNA sequences forming G-quadruplexes (G4s), provide stable and versatile molecular recognition tools. Telomeric G4 structures are biologically relevant in genome maintenance and can enrich disease-related protein interactors, making them attractive baits for translational biomarker discovery. Here, we employed an overhang human telomere model capable of forming two consecutive G4s (tel46) immobilized on Controlled Pore Glass (CPG) to profile the nuclear G4 interactome in two TNBC cell lines, MDA-MB-231 and BT-549. Using affinity purification–mass spectrometry (AP-MS) with CPG-tel46 combined with quantitative proteomics and stringent background subtraction, we identified tel46-associated proteins consistently upregulated in both tumour models and prioritized 11 candidates supported by downstream bioinformatic validation. To move beyond single-marker evaluation, we implemented a machine-learning framework to assess candidate proteins as a coordinated molecular signature. Regularized models with embedded feature selection and cross-validation were used to identify stable, discriminative combinations while controlling overfitting. This integrative strategy supports G4-based capture as a practical approach to enrich clinically relevant interactors and prioritize diagnostic panels. We propose a five-protein signature (KIF4A, ACIN1, RBM12, FOXK1 and NCAPD2) as a candidate classifier for TNBC early diagnosis, providing a foundation for independent validation in clinical cohorts.

INSTRUMENT(S):

ORGANISM(S): Homo Sapiens (human)

TISSUE(S): Breast

DISEASE(S): Breast Cancer

SUBMITTER: ilaria iacobucci  

LAB HEAD: Maria Monti

PROVIDER: PXD072964 | Pride | 2026-05-29

REPOSITORIES: Pride

Dataset's files

Source:
Action DRS
BT1_REP1_20250714044634.raw Raw
BT1_REP2_20250714061946.raw Raw
BT1_REP3_20250714075300.raw Raw
BT2_REP1_20250714095944.raw Raw
BT2_REP2_20250714113258.raw Raw
Items per page:
1 - 5 of 30
altmetric image

Publications

Integrating proteomics and bioinformatics for the identification of breast cancer biomarkers interacting with telomeric G-quadruplex.

Iacobucci Ilaria I   Cipollone Irene I   Cozzolino Flora F   Gaglione Rosa R   Mentino Maria Rosaria MR   Platella Chiara C   Musumeci Domenica D   Arciello Angela A   Montesarchio Daniela D   Monti Maria M  

Cancer cell international 20250902 1


The identification of reliable biomarkers is essential for improving breast cancer (BC) detection, prognosis, and treatment. This study explores a human telomeric G-quadruplex (G4) model, tel<sub>46</sub>, functionalized on Controlled Pore Glass (CPG) support, as a novel biomarker discovery tool. The oligonucleotide tel<sub>46</sub> mimics multimeric G4 structures in telomeric overhangs. Using affinity purification-mass spectrometry, 93 proteins interacting with tel<sub>46</sub> were identified  ...[more]

Similar Datasets

2026-05-29 | PXD072866 | Pride
2021-07-19 | PXD026830 | Pride
2025-06-05 | PXD063882 | Pride
2026-04-10 | GSE283485 | GEO
2021-04-23 | GSE173074 | GEO
2026-05-15 | GSE288516 | GEO
2026-05-15 | GSE288430 | GEO
2026-05-15 | GSE330557 | GEO
2016-08-05 | E-GEOD-76688 | biostudies-arrayexpress
2025-03-24 | GSE260479 | GEO