SiGN-BN is gene network estimation software using Bayesian network model and nonparametric regression. It can estimate regulatory dependencies between genes as gene networks from gene expression data such as individual cell samples, gene knocked-down cell samples, drug-stimulated time series (time course) samples, and so on. For dynamic data such as time series data, SiGN-BN estimates a dynamic Bayesian network which takes dependencies between time points into account. For static data, it estimates a static (ordinal) Bayesian network that assumes each sample is independent from each other and does not consider temporal changes in expression data.
SiGN-BN implements several algorithms for estimating gene networks using Bayesian network model. Generally, because a Bayesian network requires huge computational time to learn its structure fitted to given gene expression data, it is not widely used for large-scale gene regulatory network analyses. Our research group develops several algorithms to overcome this problem using supercomputers. Currently, three algorithms are available depending on the size of gene networks: (a) the greedy hill-climbing (HC) algorithm + bootstrap method applicable to up to 1000 genes, (b) the neighbor node sampling & repeat (NNSR) algorithm applicable to genome-wide (whole genome) gene networks, and (c) parallel optimal structure search (Para-OS) algorithm for estimating the mathematically optimal network structures for small networks consisting of up to 32 genes.
Currently, SiGN-BN is available for HGC Supercomputer and AICS K computer and/or its compatible systems.
HC+Bootstrap: Release 1.5.7
NNSR: Release 0.14.2
Para-OS: Release 0.1.2
SiGN-BN is developed in the ISLiM (Next-generation integrated simulation of living matter) project in RIKEN Computational Science Research Program. Computational resources required for the development of SiGN-BN was provided by the HGC Supercomputer System, Human Genome Center, Institute of Medical Science, The University of Tokyo; and RIKEN Supercomputer system RICC.
 Imoto, S., Goto, T., and Miyano, S. (2002). Estimation of genetic networks and functional structures between genes by using Bayesian network and nonparametric regression. Pacific Symposium on Biocomputing, 7, 175-186.
 Tamada, Y., Shimamura, T., Yamaguchi, R., Imoto, S., Nagasaki, M., and Miyano, S. (2011). SiGN: Large-scale gene network estimation environment for high performance computing, Genome Informatics, 25 (1), 40-52.
 Tamada, Y., Imoto, S., Araki, H., Nagasaki, M., Print, C., Charnock-Jones, D.S., and Miyano, S. (2011). Estimating genome-wide gene networks using nonparametric Bayesian network models on massively parallel computers, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 8 (3), 683-697.
 Tamada, Y., Imoto, S., and Miyano, S. (2011). Parallel algorithm for learning optimal Bayesian network structure, Journal of Machine Learning Research, 12, 2437-2459.
 Honda, H., Tamada, Y., and Suda, R., (2016). Efficient Parallel Algorithm for Optimal DAG Structure Search on Parallel Computer with Torus Network, In Proceedings of the 16th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP 2016), accepted.