Causality challenge #2: Pot-Luck

The Causality Workbench Team - Isabelle Guyon, Constantin Aliferis, Greg Cooper, André Elisseeff, Jean-Philippe Pellet, Peter Spirtes

We organized for NIPS 2008 a challenge in causality. The challenge was organized around a number of proposed tasks, but the participants were encouraged to bring their own problems (hence the name "pot-luck"). The initial five tasks were:
  1. CYTO: Causal Protein-Signaling Networks in human T cells. Learn a protein signaling network from multicolor flow cytometry data. N=11 proteins, P~800 samples per experimental condition. E=9 conditions.
  2. LOCANET: LOcal CAusal NETwork. Find the local causal structure around a given target variable (depth 3 network) in REGED, CINA, SIDO, MARTI.
  3. PROMO: Simulated marketing task. Time series of 1000 promotion variables and 100 product sales. Predict a 1000x100 boolean influence matrix, indicating for each (i,j) element whether the ith promotion has a causal influence of the sales of the jth product. Data is provided as time series, with a daily value for each variable for three years.
  4. SIGNET: Abscisic Acid Signaling Network. Determine the set of 43 boolean rules that describe the interactions of the nodes within a plant signaling network. 300 separate Boolean pseudodynamic simulations of the true rules. Model inspired by a true biological system.
  5. TIED: Target Information Equivalent Dataset. Illustrates a case in which there are many equivalent Markov boundaries. Find them all.
Dozens of researchers tried some of the tasks and/or worked on preparing new tasks. Two new contributed datasets were added in the course of the challenge.
  1. CauseEffetPairs: Find the causal direction in eight pairs of variables. This task is built from real time series of weather data (e.g., temperature and altitude for a pair).
  2. STEMMATOLOGY: Reconstruct a family tree of documents derived from one another.
In spite of the fact that it was introduced only mid-way through the challenge, the  CauseEffetPairs task received a lot of attention and the winners, Kun Zhang and Aapo Hyvärinen (University of Helsinki, Finland) correctly discovered the cause-effect direction in all 8 pairs of variables. This result was found statistically significant in a sign rank test (with risk <1%). Five other datasets were contributed, but were not ready soon enough to be made part of the challenge. One of them donated by Guido Nolte (Fraunhofer FIRST, Berlin, Germany) the NOISE dataset of EEG signals, won the best dataset award. The new donated datasets are available from our repository and we intend to organize new events around them.
Significant progresses were made on several other tasks of the challenge. Noteworthy are five contributions, which received special mentions:

NIPS 2008 workshop on causality