DCE
Dense Cluster Enumeration in Tensors

News

Introduction

DCE (Dense Cluster Enumeration) is a C++ software for enumerating all subtensor patterns that exceed a user-defined minimum density threshold.

Given an n-dimensional data array (tensor), a subtensor pattern is defined by specifying a non-empty subset for each of the n dimension-specific index sets. The density of a subtensor is defined as the average value of its elements. The input tensor can contain binary or real values. However, the current implementation assumes non-negativity. If you would like to analyze mixed-sign tensors, please contact the email address given below. The output consists of all locally maximal patterns, which can be ranked according to exact probability values. Furthermore, the method optionally takes into account inherent symmetry constraints. Detailed information about the algorithm can be found in the paper Multi-Way Set Enumeration in Real-Valued Tensors appeared in the Proceedings of DMMT'09 (SIGKDD 2009 Workshop on Data Mining using Matrices and Tensors).

Download

License: DCE is free software under the GNU General Public License, version 2.

Manual

Compilation:

>> make

Usage:

>> ./dce <input_file> <output_file> <density_threshold> [options]

General usage comments:

Examples:
>> ./dce testdata/test2way.txt testres/test2way_75 0.75
>> ./dce testdata/test4way.txt testres/test4way_100 1.0 --minsize=2
>> ./dce testdata/test4way.txt testres/test4way_100_b 1.0 --minsize=2 --balance=1
>> ./dce testdata/test3waySymm.txt testres/test3waySymm_90_2 0.9 --symmetry=0-1 --maxbranches=2 --ranking=0

Publications

Contact