CHISSL: A Human-Machine Collaboration Space for Unsupervised Learning

  • Dustin ArendtEmail author
  • Caner Komurlu
  • Leslie M. Blaha
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10284)


We developed CHISSL, a human-machine interface that utilizes interactive supervision to help the user group unlabeled instances by her own mental model. The user primarily interacts via correction (moving a misplaced instance into its correct group) or confirmation (accepting that an instance is placed in its correct group). Concurrent with the user’s interactions, CHISSL trains a classification model guided by the user’s grouping of the data. It then predicts the group of unlabeled instances and arranges some of these alongside the instances manually organized by the user. We hypothesize that this mode of human and machine collaboration is more effective than Active Learning, wherein the machine decides for itself which instances should be labeled by the user. We found supporting evidence for this hypothesis in a pilot study where we applied CHISSL to organize a collection of handwritten digits.


Human-machine interface Interactive clustering Active learning Semi-supervised learning Direct manipulation 



The research described in this document was sponsored the U.S. Department of Energy (DOE) through the Analysis in Motion Initiative at Pacific Northwest National Laboratory. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the U.S. Government.


  1. 1.
    Amershi, S., Fogarty, J., Weld, D.: Regroup. In: Proceedings of the 2012 ACM Annual Conference on Human Factors in Computing Systems - CHI 2012, p. 21. ACM, New York (2012)Google Scholar
  2. 2.
    Basu, S., Drucker, S.M., Lu, H.: Assisting users with clustering tasks by combining metric learning and classification. In: AAAI, pp. 394–400 (2010)Google Scholar
  3. 3.
    Fuchs, J., et al.: The influence of contour on similarity perception of star glyphs. IEEE Trans. Vis. Comput. Graph. 20(12), 2251–2260 (2014)CrossRefGoogle Scholar
  4. 4.
    Bilenko, M., Basu, S., Mooney, R.J.: Integrating constraints and metric learning in semi-supervised clustering. In: Twenty-First International Conference on Machine Learning - ICML 2004, p. 11 (2004)Google Scholar
  5. 5.
    Borgo, R., Kehrer, J., Chung, D.H.S., Maguire, E., Laramee, R.S., Hauser, H., Ward, M., Chen, M.: Glyph-based visualization: foundations, design guidelines, techniques and applications. In: Eurographics State of the Art Reports, pp. 39–63 (2013)Google Scholar
  6. 6.
    Brown, E.T., Liu, J., Brodley, C.E., Chang, R.: Dis-function: learning distance functions interactively. In: 2012 IEEE Conference on Visual Analytics Science and Technology (VAST), pp. 83–92. IEEE (2012)Google Scholar
  7. 7.
    Chi, M.T., Glaser, R., Farr, M.J.: The Nature of Expertise. Psychology Press, Hove (2014)zbMATHGoogle Scholar
  8. 8.
    Choo, J., Lee, H., Kihm, J., Park, H.: iVisClassifier: an interactive visual analytics system for classification based on supervised dimension reduction. In: 2010 IEEE Symposium on Visual Analytics Science and Technology, pp. 27–34. IEEE (2010)Google Scholar
  9. 9.
    Chuang, J., Ramage, D., Manning, C.D., Heer, J.: Interpretation and trust: designing model-driven visualizations for text analysis. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 443–452 (2005)Google Scholar
  10. 10.
    Cohn, D., Caruana, R.: Semi-supervised clustering: incorporating user feedback to improve cluster utility. In: Proceedings of the Conference on Artificial Intelligence (2000)Google Scholar
  11. 11.
    Daszykowski, M., Walczak, B.: Density-based clustering methods. In: Comprehensive Chemometrics, vol. 96, pp. 635–654. Elsevier (2009)Google Scholar
  12. 12.
    Endert, A., Fiaux, P., North, C.: Semantic interaction for sensemaking: inferring analytical reasoning for model steering. IEEE Trans. Vis. Comput. Graph. 18(12), 2879–2888 (2012)CrossRefGoogle Scholar
  13. 13.
    Gauthier, I., Skudlarski, P., Gore, J.C., Anderson, A.W.: Expertise for cars and birds recruits brain areas involved in face recognition. Nat. Neurosci. 3(2), 191–197 (2000)CrossRefGoogle Scholar
  14. 14.
    Goldstone, R.: An efficient method for obtaining similarity data. Behav. Res. Methods, Instrum. Comput. 26(4), 381–386 (1994)CrossRefGoogle Scholar
  15. 15.
    Hubert, L., Arabie, P.: Comparing partitions. J. Classif. 2(1), 193–218 (1985)CrossRefzbMATHGoogle Scholar
  16. 16.
    Kaynak, C.: Methods of combining multiple classifiers and their applications to handwritten digit recognition. Master’s thesis, Institute of Graduate Studies in Science and Engineering, Bogazici University (1995)Google Scholar
  17. 17.
    Kulesza, T., Amershi, S., Caruana, R., Fisher, D., Charles, D.: Structured labeling for facilitating concept evolution in machine learning. In: Proceedings of the 32nd Annual ACM Conference on Human Factors in Computing Systems - CHI 2014, pp. 3075–3084. ACM, New York (2014)Google Scholar
  18. 18.
    Lewis, D.D., Gale, W.A.: A sequential algorithm for training text classifiers. In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 3–12. Springer, New York (1994)Google Scholar
  19. 19.
    Lichman, M.: UCI machine learning repository (2013).
  20. 20.
    Poggio, T., Cauwenberghs, G.: Incremental and decremental support vector machine learning. Adv. Neural Inf. Process. Syst. 13, 409 (2001)Google Scholar
  21. 21.
    Roy, N., McCallum, A.: Toward optimal active learning through sampling estimation of error reduction. In: ICML, pp. 441–448 (2001)Google Scholar
  22. 22.
    Tanahashi, Y., Hsueh, C.-H., Ma, K.-L.: An efficient framework for generating storyline visualizations from streaming data. IEEE Trans. Vis. Comput. Graph. 21(6), 730–742 (2015)CrossRefGoogle Scholar
  23. 23.
    Seung, H.S., Opper, M., Sompolinsky, H.: Query by committee. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 287–294. ACM (1992)Google Scholar
  24. 24.
    Smith, J.D., Berg, M.E., Cook, R.G., Murphy, M.S., Crossley, M.J., Boomer, J., Spiering, B., Beran, M.J., Church, B.A., Ashby, F.G., et al.: Implicit and explicit categorization: a tale of four species. Neurosci. Biobehav. Rev. 36(10), 2355–2369 (2012)CrossRefGoogle Scholar
  25. 25.
    Srivastava, A., Zou, J., Sutton, C.: Clustering with a Reject Option: Interactive Clustering as Bayesian Prior Elicitation, pp. 120–126, February 2016Google Scholar
  26. 26.
    Tanaka, J.W., Curran, T., Sheinberg, D.L.: The training and transfer of real-world perceptual expertise. Psychol. Sci. 16(2), 145–151 (2005)CrossRefGoogle Scholar
  27. 27.
    Vapnik, V.N., Vapnik, V.: Statistical Learning Theory, vol. 1. Wiley, New York (1998)zbMATHGoogle Scholar
  28. 28.
    Yu, H.F., Huang, F.L., Lin, C.J.: Dual coordinate descent methods for logistic regression and maximum entropy models. Mach. Learn. 85(1–2), 41–75 (2011)MathSciNetCrossRefzbMATHGoogle Scholar
  29. 29.
    Zhu, X.: Semi-supervised learning literature survey. Technical report 1530, Computer Sciences, University of Wisconsin-Madison (2005)Google Scholar
  30. 30.
    Liu, S., et al.: Storyflow: tracking the evolution of stories. IEEE Trans. Vis. Comput. Graph. 19(12), 2436–2445 (2013)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Dustin Arendt
    • 1
    Email author
  • Caner Komurlu
    • 2
  • Leslie M. Blaha
    • 1
  1. 1.Pacific Northwest National LaboratoryRichlandUSA
  2. 2.Illinois Institute of TechnologyChicagoUSA

Personalised recommendations