Major information on this web site is for old version of SiGN-BN. For documentation of new SiGN-BN 2, please visit the web site for INGOR, that is our new implementation of Bayesian network estimation software..
SiGN-BN is gene network estimation software using Bayesian network model and B-spline nonparametric regression. It can estimate regulatory dependencies between genes as gene networks from gene expression data such as individual cell samples, gene knocked-down cell samples, drug-stimulated time series (time course) samples, and so on. For dynamic data such as time series data, SiGN-BN estimates a dynamic Bayesian network which takes dependencies between time points into account. For static data, it estimates a static (ordinal) Bayesian network that assumes each sample is independent from each other and does not consider temporal changes in expression data.
SiGN-BN is an implementation of the gene network estimation method originally developed by Prof. Seiya Imoto, Dr. Takao Goto and Prof. Satoru Miyano (See [1]). It also implements Prof. Tamada's whole genome gene network estimation method called the NNSR algorithm (See [3]), and parallel optimal structure estimation method (See [6]). Generally, because a Bayesian network requires huge computational time to learn its structure fitted to given gene expression data, it is not widely used for large-scale gene regulatory network analyses. Our research group develops several algorithms to overcome this problem using supercomputers. Currently, three algorithms are available depending on the size of gene networks: (a) the greedy hill-climbing (HC) algorithm + bootstrap method applicable to up to 1000 genes, (b) the neighbor node sampling & repeat (NNSR) algorithm applicable to genome-wide (whole genome) gene networks, and (c) parallel optimal structure search (Para-OS) algorithm for estimating the mathematically optimal network structures for small networks consisting of up to 37 genes.
Currently, executable binaries of SiGN-BN HC+BS and NNSR are freely available for Linux x86-64 systems and R-CCS supercomputer Fugaku.
SiGN-BN: Release 2.0
Tamada, Y., Shimamura, T., Yamaguchi, R., Imoto, S., Nagasaki, M., and Miyano, S., SiGN: Large-scale gene network estimation environment for high performance computing, Genome Informatics, 25 (1), 40-52, 2011. (See in Pubmed)If you used the NNSR algorithm, please cite the following paper in addition to the above:
Tamada, Y., Imoto, S., Araki, H., Nagasaki, M., Print, C., Charnock-Jones, D.S., and Miyano, S., Estimating genome-wide gene networks using nonparametric Bayesian network models on massively parallel computers, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 8 (3), 683-697, 2011. (See in Pubmed)If you used the method for sample-specific evaluation of the estimated edges, which we call edge contribution values (ECvs), please cite the following paper:
Tanaka, Y., Tamada, Y., Ikeguchi, M., Yamashita, F., and Okuno, Y. (2020). System-Based Differential Gene Network Analysis for Characterizing a Sample-Specific Subnetwork, Biomolecules, 10(2), 306. (Pubmed, Full text)See SiGN Publication List for the complete list of our publications using SiGN software. For publications specific to SiGN-BN algorithms, see REFERENCE below.
SiGN-BN is based on the software that was originally developed in Laboratory of DNA Information Analysis, Human Genome Center, Institute of Medical Science, The University of Tokyo, that existed until March 2020, and then it was developed as SiGN-BN in the ISLiM (Next-generation integrated simulation of living matter) project in RIKEN Computational Science Research Program. Currently, it is developed and maintained by Prof. Yoshinori Tamada at Hirosaki University, and is also supported by Department of Biomedical Data Intelligence, Graduate School of Medicine, Kyoto Univeristy and Division of Health Medical Intelligence, Laboratory of Sequene Analysis, Human Genome Center, Institute of Medical Science, The University of Tokyo. Computational resources required for the development of SiGN-BN was provided by the HGC Supercomputer System, Human Genome Center, Institute of Medical Science, The University of Tokyo; and RIKEN Supercomputer system RICC, R-CCS K computer, and supercomputer Fugaku.
[1] Imoto, S., Goto, T., and Miyano, S. (2002). Estimation of genetic networks and functional structures between genes by using Bayesian network and nonparametric regression. Pacific Symposium on Biocomputing, 7, 175-186. (PDF on online proceedings)
[2] Tamada, Y., Shimamura, T., Yamaguchi, R., Imoto, S., Nagasaki, M., and Miyano, S. (2011). SiGN: Large-scale gene network estimation environment for high performance computing, Genome Informatics, 25(1), 40-52. (Pubmed)
[3] Tamada, Y., Imoto, S., Araki, H., Nagasaki, M., Print, C., Charnock-Jones, D.S., and Miyano, S. (2011). Estimating genome-wide gene networks using nonparametric Bayesian network models on massively parallel computers, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 8 (3), 683-697. (Pubmed)
[4] Tamada, Y., Imoto, S., and Miyano, S. (2011). Parallel algorithm for learning optimal Bayesian network structure, Journal of Machine Learning Research, 12, 2437-2459.
[5] Honda, H., Tamada, Y., and Suda, R., (2016). Efficient Parallel Algorithm for Optimal DAG Structure Search on Parallel Computer with Torus Network, In Proceedings of the 16th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP 2016), Lecture Notes in Compute Sicence, 10048, 483-502.
[6] Tamada, Y. (2018). Memory Efficient Parallel Algorithm for Optimal DAG Structure Search using Direct Communication, Journal of Parallel and Distributed Computing, 119, 27-35, 2018.
[7] Tanaka, Y., Tamada, Y., Ikeguchi, M., Yamashita, F., and Okuno, Y. (2020). System-Based Differential Gene Network Analysis for Characterizing a Sample-Specific Subnetwork, Biomolecules, 10(2), 306. (Pubmed)