Skip to main content

The Maximum Weighted Submatrix Coverage Problem: A CP Approach

  • Conference paper
  • First Online:
Integration of Constraint Programming, Artificial Intelligence, and Operations Research (CPAIOR 2019)

Abstract

The objective of the maximum weighted submatrix coverage problem (MWSCP) is to discover K submatrices that together cover the largest sum of entries of the input matrix. The special case of \(K=1\) called the maximal-sum submatrix problem was successfully solved with CP. Unfortunately, the case of \(K>1\) is more difficult to solve as the selection of the rows of the submatrices cannot be decided in polynomial time solely from the selection of K sets of columns. The search space is thus substantially augmented compared to the case \(K=1\). We introduce a complete CP approach for solving this problem efficiently composed of the major CP ingredients: (1) filtering rules, (2) a lower bound, (3) dominance rules, (4) variable-value heuristic, and (5) a large neighborhood search. As the related biclustering problem, MWSCP has many practical data-mining applications such as gene module discovery in bioinformatics. Through multiple experiments on synthetic and real datasets, we provide evidence of the practicality of the approach both in terms of computational time and quality of the solutions discovered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Notice that the optimal solution may be slightly different than the implanted submatrices because of the noise addition.

References

  1. Aoga, J.O.R., Guns, T., Schaus, P.: An efficient algorithm for mining frequent sequence with constraint programming. In: Frasconi, P., Landwehr, N., Manco, G., Vreeken, J. (eds.) ECML PKDD 2016. LNCS (LNAI), vol. 9852, pp. 315–330. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46227-1_20

    Chapter  Google Scholar 

  2. Bentley, J.: Programming pearls: algorithm design techniques. Commun. ACM 27(9), 865–873 (1984)

    Article  MathSciNet  Google Scholar 

  3. Bessiere, C., De Raedt, L., Kotthoff, L., Nijssen, S., O’Sullivan, B., Pedreschi, D. (eds.): Data Mining and Constraint Programming. LNCS (LNAI), vol. 10101. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-50137-6

    Book  Google Scholar 

  4. Branders, V., Schaus, P., Dupont, P.: Mining a sub-matrix of maximal sum. In: Proceedings of the 6th International Workshop on New Frontiers in Mining Complex Patterns in Conjunction with ECML-PKDD 2017 (2017)

    Google Scholar 

  5. Chabert, M., Solnon, C.: Constraint programming for multi-criteria conceptual clustering. In: Beck, J.C. (ed.) CP 2017. LNCS, vol. 10416, pp. 460–476. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-66158-2_30

    Chapter  Google Scholar 

  6. Dao, T., Docquier, F., Maurel, M., Schaus, P.: Global migration in the 20th and 21st centuries: the unstoppable force of demography (2018)

    Google Scholar 

  7. Duong, K.C., Vrain, C., et al.: Constrained clustering by constraint programming. Artif. Intell. 244, 70–94 (2017)

    Article  MathSciNet  Google Scholar 

  8. Gervet, C.: Interval propagation to reason about sets: definition and implementation of a practical language. Constraints 1(3), 191–244 (1997)

    Article  MathSciNet  Google Scholar 

  9. Guillaume, D., Vincent, B., Pierre, D., Pierre, S.: Synthetic dataset used in "the maximum weighted submatrix coverage problem: A CP approach, November 2018. https://doi.org/10.5281/zenodo.1688740

  10. IOC Research and Reference Service, The Guardian: Olympic sports and medals 1896–2014. https://www.kaggle.com/the-guardian/olympic-games

  11. Karp, R.M.: Reducibility among combinatorial problems. In: Miller, R.E., Thatcher, J.W., Bohlinger, J.D. (eds.) Complexity of Computer Computations, pp. 85–103. Springer, Boston (1972). https://doi.org/10.1007/978-1-4684-2001-2_9

    Chapter  Google Scholar 

  12. Kuo, C.T., Ravi, S., Vrain, C., Davidson, I., et al.: Descriptive clustering: ILP and CP formulations with applications. In: IJCAI-ECAI 2018, the 27th International Joint Conference on Artificial Intelligence and the 23rd European Conference on Artificial Intelligence (2018)

    Google Scholar 

  13. Michel, L., Schaus, P., Van Hentenryck, P.: MiniCP: a lightweight solver for constraint programming (2018). https://minicp.bitbucket.io

  14. Le Van, T., van Leeuwen, M., Nijssen, S., Fierro, A.C., Marchal, K., De Raedt, L.: Ranked tiling. In: Calders, T., Esposito, F., Hüllermeier, E., Meo, R. (eds.) ECML PKDD 2014. LNCS (LNAI), vol. 8725, pp. 98–113. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-44851-9_7

    Chapter  Google Scholar 

  15. Madeira, S.C., Oliveira, A.L.: Biclustering algorithms for biological data analysis: a survey. IEEE/ACM Trans. Comput. Biol. Bioinform. (TCBB) 1(1), 24–45 (2004)

    Article  Google Scholar 

  16. OscaR Team: OscaR: Scala in OR (2012). https://bitbucket.org/oscarlib/oscar

  17. de Saint-Marcq, V.L.C., Schaus, P., Solnon, C., Lecoutre, C.: Sparse-sets for domain implementation. In: CP workshop on Techniques for Implementing Constraint Programming Systems (TRICS), pp. 1–10 (2013)

    Google Scholar 

  18. Schaus, P., Aoga, J.O.R., Guns, T.: CoverSize: a global constraint for frequency-based itemset mining. In: Beck, J.C. (ed.) CP 2017. LNCS, vol. 10416, pp. 529–546. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-66158-2_34

    Chapter  MATH  Google Scholar 

  19. Shaw, P.: Using constraint programming and local search methods to solve vehicle routing problems. In: Maher, M., Puget, J.-F. (eds.) CP 1998. LNCS, vol. 1520, pp. 417–431. Springer, Heidelberg (1998). https://doi.org/10.1007/3-540-49481-2_30

    Chapter  Google Scholar 

  20. de Souto, M.C., Costa, I.G., de Araujo, D.S., Ludermir, T.B., Schliep, A.: Clustering cancer gene expression data: a comparative study. BMC Bioinform. 9(1), 497 (2008)

    Article  Google Scholar 

Download references

Acknowledgments

Computational resources have been provided by the Consortium des Équipements de Calcul Intensif (CCI), funded by the Fonds de la Recherche Scientifique de Belgique (F.R.S.-FNRS) under Grant No. 2.5020.11.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guillaume Derval .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Derval, G., Branders, V., Dupont, P., Schaus, P. (2019). The Maximum Weighted Submatrix Coverage Problem: A CP Approach. In: Rousseau, LM., Stergiou, K. (eds) Integration of Constraint Programming, Artificial Intelligence, and Operations Research. CPAIOR 2019. Lecture Notes in Computer Science(), vol 11494. Springer, Cham. https://doi.org/10.1007/978-3-030-19212-9_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-19212-9_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-19211-2

  • Online ISBN: 978-3-030-19212-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics