<HashMap><database>biostudies-literature</database><scores/><additional><submitter>Jo J</submitter><funding>National Research Foundation of Korea</funding><pagination>e0278570</pagination><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC9714948</full_dataset_link><repository>biostudies-literature</repository><omics_type>Unknown</omics_type><volume>17(12)</volume><pubmed_abstract>High-dimensional LASSO (Hi-LASSO) is a powerful feature selection tool for high-dimensional data. Our previous study showed that Hi-LASSO outperformed the other state-of-the-art LASSO methods. However, the substantial cost of bootstrapping and the lack of experiments for a parametric statistical test for feature selection have impeded to apply Hi-LASSO for practical applications. In this paper, the Python package and its Spark library are efficiently designed in a parallel manner for practice with real-world problems, as well as providing the capability of the parametric statistical tests for feature selection on high-dimensional data. We demonstrate Hi-LASSO's outperformance with various intensive experiments in a practical manner. Hi-LASSO will be efficiently and easily performed by using the packages for feature selection. Hi-LASSO packages are publicly available at https://github.com/datax-lab/Hi-LASSO under the MIT license. The packages can be easily installed by Python PIP, and additional documentation is available at https://pypi.org/project/hi-lasso and https://pypi.org/project/Hi-LASSO-spark.</pubmed_abstract><journal>PloS one</journal><pubmed_title>Hi-LASSO: High-performance python and apache spark packages for feature selection with high-dimensional data.</pubmed_title><pmcid>PMC9714948</pmcid><funding_grant_id>NRF-2021R1I1A3048029</funding_grant_id><pubmed_authors>Park J</pubmed_authors><pubmed_authors>Kang M</pubmed_authors><pubmed_authors>Kim Y</pubmed_authors><pubmed_authors>Jo J</pubmed_authors><pubmed_authors>Jung S</pubmed_authors></additional><is_claimable>false</is_claimable><name>Hi-LASSO: High-performance python and apache spark packages for feature selection with high-dimensional data.</name><description>High-dimensional LASSO (Hi-LASSO) is a powerful feature selection tool for high-dimensional data. Our previous study showed that Hi-LASSO outperformed the other state-of-the-art LASSO methods. However, the substantial cost of bootstrapping and the lack of experiments for a parametric statistical test for feature selection have impeded to apply Hi-LASSO for practical applications. In this paper, the Python package and its Spark library are efficiently designed in a parallel manner for practice with real-world problems, as well as providing the capability of the parametric statistical tests for feature selection on high-dimensional data. We demonstrate Hi-LASSO's outperformance with various intensive experiments in a practical manner. Hi-LASSO will be efficiently and easily performed by using the packages for feature selection. Hi-LASSO packages are publicly available at https://github.com/datax-lab/Hi-LASSO under the MIT license. The packages can be easily installed by Python PIP, and additional documentation is available at https://pypi.org/project/hi-lasso and https://pypi.org/project/Hi-LASSO-spark.</description><dates><release>2022-01-01T00:00:00Z</release><publication>2022</publication><modification>2025-04-22T01:54:43.962Z</modification><creation>2025-04-05T20:07:28.297Z</creation></dates><accession>S-EPMC9714948</accession><cross_references><pubmed>36455001</pubmed><doi>10.1371/journal.pone.0278570</doi></cross_references></HashMap>