Online Semi-supervised Learning for Multi-target Regression in Data Streams Using AMRules

  • Ricardo SousaEmail author
  • João Gama
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9897)


Most data streams systems that use online Multi-target regression yield vast amounts of data which is not targeted. Targeting this data is usually impossible, time consuming and expensive. Semi-supervised algorithms have been proposed to use this untargeted data (input information only) for model improvement. However, most algorithms are adapted to work on batch mode for classification and require huge computational and memory resources.

Therefore, this paper proposes an semi-supervised algorithm for online processing systems based on AMRules algorithm that handle both targeted and untargeted data and improves the regression model. The proposed method was evaluated through a comparison between a scenario where the untargeted examples are not used on the training and a scenario where some untargeted examples are used. Evaluation results indicate that the use of the untargeted examples improved the target predictions by improving the model.


Multi-target regression Semi-supervised learning AMRules Data streams 



This work was partly supported by the European Commission through MAESTRA (ICT-2013-612944) and the Project TEC4Growth - Pervasive Intelligence, Enhancers and Proofs of Concept with Industrial Impact/NORTE-01-0145-FEDER-000020 is financed by the North Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement, and through the European Regional Development Fund (ERDF).


  1. 1.
    Borchani, H., Varando, G., Bielza, C., Larrañaga, P.: A survey on multi-output regression. Wiley Int. Rev. Data Min. Knowl. Disc. 5(5), 216–233 (2015)CrossRefGoogle Scholar
  2. 2.
    Levatic, J., Ceci, M., Kocev, D., Dzeroski, S.: Semi-supervised learning for multi-target regression. In: Third International Workshop, NFMCP, Held in Conjunction with ECML-PKDD, pp. 3–18 (2014)Google Scholar
  3. 3.
    Zhou, Z.H., Li, M.: Semi-supervised regression with co-training style algorithms. IEEE Trans. Knowl. Data Eng. 19(11), 1479–1493 (2007)CrossRefGoogle Scholar
  4. 4.
    Duarte J., Gama, J.: Multi-target regression from high-speed data streams with adaptive model rules. In: IEEE Conference on Data Science and Advanced Analytics (2015)Google Scholar
  5. 5.
    Goldberg, A.B., Zhu, X., Furger, A., Jun-Ming, X.: OASIS: online active semi-supervised learning. In: Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, AAAI, San Francisco, California, USA, 7–11 August 2011Google Scholar
  6. 6.
    Kang, P., Kim, D., Cho, S.: Semi-supervised support vector regression based on self-training with label uncertainty: an application to virtual metrology in semiconductor manufacturing. Expert Syst. Appl. 51, 85–106 (2016)CrossRefGoogle Scholar
  7. 7.
    Ozoh, P., Abd-rahman, S., Labadin, J., Apperley, M.: Article: a comparative analysis of techniques for forecasting electricity consumption. Int. J. Comput. Appl. 88(15), 8–12 (2014)Google Scholar
  8. 8.
    Chalabi, Z., Mangtani, P., Hashizume, M., Imai, C., Armstrong, B.: Article: time series regression model for infectious disease and weather. Int. J. Environ. Res. 142, 319–327 (2015)Google Scholar
  9. 9.
    Uslana, H.S.V.: Article: quantitative prediction of peptide binding afnity by using hybrid fuzzy support vector regression. Appl. Soft Comput. 43, 210–221 (2016)CrossRefGoogle Scholar
  10. 10.
    Ariyo, A.A., Adewumi, A.O., Ayo, C.K.: Stock price prediction using the arima model. In: Proceedings of the UKSim-AMSS 16th International Conference on Computer Modelling and Simulation, UKSIM 2014, pp. 106–112, Washington, DC, USA. IEEE Computer Society (2014)Google Scholar
  11. 11.
    Chapelle, O., Schlkopf, B., Zien, A.: Semi-Supervised Learning, 1st edn. The MIT Press, Cambridge (2010)Google Scholar
  12. 12.
    Albalate, A., Minker, W.: Semi-supervised and Unsupervised Machine Learning. ISTE/Wiley, London (2011)Google Scholar
  13. 13.
    Verbeek, J.J., Vlassis, N.: Gaussian fields for semi-supervised regression and correspondence learning. Pattern Recogn. 39(10), 1864–1875 (2006)CrossRefzbMATHGoogle Scholar
  14. 14.
    Radosavljevic, V., Vucetic, S., Obradovic, Z.: Continuous conditionalrandom fields for regression in remote sensing. In: 19th European Conference on Artificial Intelligence, Proceedings of the 2010 Conference on ECAI 2010, pp. 809–814, Amsterdam, The Netherlands. IOS Press (2010)Google Scholar
  15. 15.
    Stojanovic, J., Jovanovic, M., Gligorijevic, D., Obradovic, Z.: Semi-supervised learning for structured regression on partially observed attributed graphs. In: SIAM International Conference on Data Mining (SDM) (2015)Google Scholar
  16. 16.
    Bhattacharyya, B.B.: One sided Chebyshev inequality when the first four moments are known. Commun. Stat. Theor. Methods 16(9), 2789–2791 (1987)MathSciNetCrossRefzbMATHGoogle Scholar
  17. 17.
    Gama, J., Sebastião, R., Rodrigues, P.P.: On evaluating stream learning algorithms. Mach. Learn. 90(3), 317–346 (2013)MathSciNetCrossRefzbMATHGoogle Scholar
  18. 18.
    Chen, W.: Passive, Active, and Digital Filters, 3rd edn. CRC Press, Baco Raton (2009)Google Scholar
  19. 19.
    Friedman, J.H.: Multivariate adaptive regression splines. Ann. Stat. 19(1), 1–67 (1991)MathSciNetCrossRefzbMATHGoogle Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  1. 1.LIAAD/INESC TECUniversidade do PortoPortoPortugal
  2. 2.Faculdade de EconomiaUniversidade do PortoPortoPortugal

Personalised recommendations