Dataset Information

Mixture of personality improved spiking actor network for efficient multi-agent cooperation.

ABSTRACT: Adaptive multi-agent cooperation with especially unseen partners is becoming more challenging in multi-agent reinforcement learning (MARL) research, whereby conventional deep-learning-based algorithms suffer from the poor new-player-generalization problem, possibly caused by not considering theory-of-mind theory (ToM). Inspired by the ToM personality in cognitive psychology, where a human can easily resolve this problem by predicting others' intuitive personality first before complex actions, we propose a biologically-plausible algorithm named the mixture of personality (MoP) improved spiking actor network (SAN). The MoP module contains a determinantal point process to simulate the formation and integration of different personality types, and the SAN module contains spiking neurons for efficient reinforcement learning. The experimental results on the benchmark cooperative overcooked task showed that the proposed MoP-SAN algorithm could achieve higher performance for the paradigms with (learning) and without (generalization) unseen partners. Furthermore, ablation experiments highlighted the contribution of MoP in SAN learning, and some visualization analysis explained why the proposed algorithm is superior to some counterpart deep actor networks.

SUBMITTER: Li X

PROVIDER: S-EPMC10361619 | biostudies-literature | 2023

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Mixture of personality improved spiking actor network for efficient multi-agent cooperation.

Li Xiyun X Ni Ziyi Z Ruan Jingqing J Meng Linghui L Shi Jing J Zhang Tielin T Xu Bo B

Frontiers in neuroscience 20230706

Adaptive multi-agent cooperation with especially unseen partners is becoming more challenging in multi-agent reinforcement learning (MARL) research, whereby conventional deep-learning-based algorithms suffer from the poor new-player-generalization problem, possibly caused by not considering theory-of-mind theory (ToM). Inspired by the ToM personality in cognitive psychology, where a human can easily resolve this problem by predicting others' intuitive personality first before complex actions, we ...[more]

PMID: 37483340

Similar Datasets

Project description:In this study we propose an extension of the N-mixture family of models that targets an improvement of the statistical properties of rare species abundance estimators when sample sizes are low, yet typical for tropical studies. The proposed method harnesses information from other species in an ecological community to correct each species' estimator. We provide guidance to determine the sample size required to estimate accurately the abundance of rare tropical species when attempting to estimate the abundance of single species.We evaluate the proposed methods using an assumption of 50 m radius plots and perform simulations comprising a broad range of sample sizes, true abundances and detectability values and a complex data generating process. The extension of the N-mixture model is achieved by assuming that the detection probabilities are drawn at random from a beta distribution in a multi-species fashion. This hierarchical model avoids having to specify a single detection probability parameter per species in the targeted community. Parameter estimation is done via Maximum Likelihood.We compared our multi-species approach with previously proposed multi-species N-mixture models, which we show are biased when the true densities of species in the community are less than seven individuals per 100 hectares. The beta N-mixture model proposed here outperforms the traditional Multi-species N-mixture model by allowing the estimation of organisms at lower densities and controlling the bias in the estimation.We illustrate how our methodology can be used to suggest sample sizes required to estimate the abundance of organisms, when these are either rare, common or abundant. When the interest is full communities, we show how the multi-species approaches, and in particular our beta model and estimation methodology, can be used as a practical solution to estimate organism densities from rapid inventory datasets. The statistical inferences done with our model via Maximum Likelihood can also be used to group species in a community according to their detectabilities.

Dataset Information

Mixture of personality improved spiking actor network for efficient multi-agent cooperation.

Publications

Mixture of personality improved spiking actor network for efficient multi-agent cooperation.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets