SiGN-Proc Manual

Introduction

Sign-proc is a CUI-based (command line) tool to process gene network files, including converting file formats, extracting subnetworks, coloring the specified nodes, and so on.

In SiGN-Proc, users can specify filters. A filter processes a network. One filter reads a network from a file. One extracts a sub network. Many filters are available in SiGN-Proc. Users can specify multiple filters. In such a case, a network processed by a filter is passed to the next filter.

Note: This document is currently under development.

Synopsis

Execution on the gateway node of the HGC supercomputer system

~tamada/sign/signbn-hcbs.sh --bin signproc [ options ] [ filters ]

Note 1: Do not execute a program for a long time on the gateway.

Note 2: The execution via signbn-hc.sh is required to set the appropriate environment variables.

Note 3: To use the latest version, use the script and the binary on ~tamada/sign with specifying the --dir ~tamada/sign option.

Execution as an Grid Engine job on the HGC supercomputer system

qsub [ GE options ] ~tamada/sign/signbn-hcbs.sh --bin signproc [ options ] [ filters ]

SGE options

See the manual for SiGN-BN.

Options

--help

Show help message.

Example

Here are some examples. Specify the following as filters above when you execute SiGN-Proc.

File format conversion

--read type=sgn3,file=network.sgn3 --output type=csml,file=network.csml

This example converts the network file network.sgn3 written in the SGN3 format into the CSML format file network.csml. See File Formats for details of the available file formats.

Subnetwork extraction

--read type=csml,file=network.csml --subnet node=IL6,dist=1 --output type=csml,file=subnet.csml

This extracts IL6 and its parents and children from network.csml.

Filters

The filter can be specified by --filter_name followed by its key=value style arguments concatenated by commas. White spaces can be inserted after the camma.


Edge Property filter (--edgeprop)

This removes all edges except for ones whose properties specified by name satisfy the condition specified by op and value.

name=property_name

type=property_type

Property type such as int, double, string, etc...

op= { eq | ge | gt | le | lt }

Operator: the values correspond to equal, greater or equal, greater, less or equall, and less.

value= value

String expression of a value to be compared.

noprop= { stop | remove | ignore }

What to do if the property is not found in an edge.

Examples:

--edgeprop name=BS.Prob,type=double,op=gt,value=0.5

This removes edges whose bootstrap probabilities are less than or equal to 0.5.

--edgeprop name=up/down,type=string,op=eq,value=up

This leaves edges that are estimated as up-regulated ones.


Read filter (--read)

This filter reads a network, data frame, or dataset from a file, and then passes it to the next filter.

type= { edf | frame | network_format }

Type of a file to read. See File Formats for the network file format.

file= file_name


Subnet filter (--subnet)

The subnet filter extracts a sub network based on a node names given by a file or a list of node names.

file=file_name

col=n

node=node1:node2:...

dist=n


Output filter (--output)

Output a network into a file. This passes the received network to the next filter without modifying or processing it. Therefore, by using multiple output filters, users can output a network in files in various formats.

file=file_name

Output file name. The path can be included in file_name.

type=network_format

See File Formats for the available network file format. By default, csml is assumed.

args=\{key1=value1,...\}

Arguments of the file format.

BS filter (--bs)

The BS filter is to compile the bootstrapped networks into a single network.

prefix=file_name_prefix

dynamic

ed=n

bg=n

th=threahold

ver=n

Version. 1 and 2 are available. By default, n=2.
^ Go to Top

Node Color filter (--nodecolor)

file=file_name

The line-by-line file containing the list of node names to color. The column position of the node names in a line can be specified by the col argument below.

col=n

color=r:g:b [:a ]

Color of the node by RGB. The integer value ranging 0 to 255 is acceptable for r (red), g (green), and b (blue). The alpha blending (a) can also be specified optionally. If it is not specified a=255 is assumed by default.

Comp filter (--comp)

The Comp filter is to compare two network structures. This filter does not change the network. Instead, it compares the network with another one specified by the arguments. After the comparison, it prints the comparison result. This regards the network given by the arguments as a true network of the demanded one and counts the number of true positive (TP), false positive (FP), true negative (TN) and false negative (FN) edges of the network passed by SiGN-Proc.

The TP edges are ones that exist in the both network. The FP edges are ones that exist only in the original network passwd by SiGN-Proc. The FN edges are ones that exist only in the true network given by the arguments.

The TP, FP, FN edges can be saved as separate network files by specifying the tp, fp, and fn arguments.

file=file_name

The file name of the true network to read and compare with.

type=file_type

The type of the file format to read. See File Formats for the available file formats.

args=\{key1=value1,...\}

Arguments of the file format.

tp=file_name

If specified, TP edges, i.e., edges that exist in the both networks, are saved as a separate network in a file file_name.

tptype=file_type

File type of the TP network.

tpargs=\{key1=value1,...\}

Arguments of the file format for the TP network.

fp=file_name

If specified, FP edges, i.e., edges that exist only in the network passed by SiGN-Proc, are saved as a separate network in a file file_name.

fptype=file_type

File type of the FP network.

fpargs=\{key1=value1,...\}

Arguments of the file format for the FP network.

fn=file_name

If specified, FN edges, i.e., edges that exist only in the network given by the file argument above, are saved as a separate network in a file file_name.

fntype=file_type

File type of the FN network.

fnargs=\{key1=value1,...\}

Arguments of the file format for the FN network.

Change History

ver. 0.19.0 (2014-08-21 Thu)

ver. 0.16.0 (2014-01-21 Tue)


Copyright © 2012-2015
SiGN Project members and Laboratory of DNA Information Analysis & Laboratory of Sequence Analysis, Human Genome Center, Institute of Medical Science, The University of Tokyo

Data Analysis Fusion Team
RIKEN Computational Science Research Program

All Rights Reserved.

Contact: Yoshinori Tamada <tamada ATMARK is.s.u-tokyo.ac.jp>