Abstract
In the last years, Universities have created an office of placement to facilitate the employability of graduates. University placement offices select for companies, which offer a job and/or training position, a large number of graduates only based on degree and grades.
We adapt c-means algorithm to discover professional profiles from job announcements. We analyse 1,650 job announcements collected in DB SOUL since January 1st, 2010 to April 5th, 2011.
Keywords
- Fuzzy Cluster
- Professional Profile
- Term Frequency Inverse Document Frequency
- Term Frequency Inverse Document Frequency
- Placement Office
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
Currently the DB SOUL collects 52,000 graduate CVs, of which about 27,000 come from “Sapienza,” 7,500 from “Roma Tre,” 2,500 from “Tor Vergata,” and 15,000 from other universities not only of the Lazio region (LUMSA, LUISS, Tuscia, and Cassino), but also from other regions (e.g., Napoli Federico II, Salerno, Bari, Bologna, Chieti-Pescara, Lecce).
References
Aureli, E., & Iezzi, D. F. (2006). Recruitment via web and information technology: A model for ranking the competences in job market. JADTm 1, 79–88.
Bezdek, J. C. (1981). Pattern recognition with fuzzy objective function algorithms. New York: Plenum.
Bolasco, S. (2010). TALTAC 2.10. Sviluppi, esperienze ed elementi essenziali di analisi automatica di testi. ROMA: LED.
Bolasco, S., & Pavone, P. (2008). Multi-class categorization based on cluster analysis and TFIDF. In S. Heiden & B. Pincemin (Eds.), JADT 2008 (Vol. 1, pp. 209–218). Lyon: Presses Universitaires de Lyon.
Borg, I., & Groenen, P. (2006). Modern multidimensional scaling: Theory and applications (Springer series in statistics). New York: Springer.
CEDEFOP. (2010). Skills supply and demand in Europe Medium-term forecast up to 2020. Luxembourg: Publication Office of the European Union.
Duda, R. O., Hart, P. E., & Stork, D. G. (2000). Pattern classification (2nd ed.). New York: Wiley.
Dunn, J. C. (1973). A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. Journal of Cybernetics, 3, 32–57.
Fabbris, L. (2009). I servizi a supporto degli studenti universitari. Padova: CLEUP.
Giuliano, L., & la Rocca, G. (2008). L’Analisi automatic e semi-automatica dei dati testuali. Software e istruzioni per l’uso. Milano: LED.
Houle, M. E., Kriegel, H. P., Kröger, P., Schubert, E., Zimek, A. (2010). Can shared-neighbor distances defeat the curse of dimensionality? Scientific and statistical database management. Lecture Notes in Computer Science, 6187, 482. doi:10.1007/978-3-642-13818-8_34. ISBN 978-3-642-13817-1.
Iezzi, D. F. (2005). A new method to measure the quality on teaching evaluation of the university system: the Italian case. Social Indicators Research, 73(3), 459–477. ISSN: 0303-8300.
Iezzi, D. F. (2008). I lavoratori come informatori delle qualità delle professioni. In F. Luigi (Ed.), Definire figure professionali tramite testimoni privilegiati, vol. 1 (pp. 135–151). Padova: Cleup.
Iezzi, D. F. (2009). La valutazione dell'efficacia esterna dei corsi universitari: problemi e prospettive. In: M. Di Monte, M. Rotili (Eds), Vincoli/constraints, vol. Sensibilia 2 (pp. 229–246). Milano: Mimesis. ISBN: 978-88-5750-065-2.
Iezzi D. F. (2010). Topic connections and clustering in text mining: An analysis of the JADT network. Statistical Analysis of Textual Data, Rome, Italy, 2(29), 719–730.
Iezzi, D. F. (Ed.). (2011). Indicatori e metodologie per la valutazione dell’efficacia del sistema universitario. Padova: CLEUP.
Iezzi, D. F. (2012a). Centrality measures for text clustering. Communications in Statistics – Theory and Methods, 41(1), 3179–3197.
Iezzi, D. F. (2012b). A new method for adapting the k-means algorithm to text mining. Italian Journal of Applied Statistics, 22(1), 69–80.
ISTAT. (2010a). I laureati e il mondo del lavoro. Roma: ISTAT.
ISTAT. (2010b). Forze di lavoro. Media 2009. Roma: ISTAT.
OCDE. (2010). Education at a glance 2010: OECD indicators. Paris: OCDE.
Pal, N. R., Bezdek, J. C., & Hathaway, R. J. (1996). Sequential competitive learning and the fuzzy c-means clustering algorithms. Neural Networks, 9(5), 787–796.
Rousseeuw, P. J. (1987). Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Computational and Applied Mathematics, 20, 53–65.
Theodoridis, S., & Koutroumbas, K. (2006). Pattern recognition (3rd ed.). Amsterdam: Elsevier.
Acknowledgment
The authors would like to particularly thank the SOUL organization for data collection, and signately Prof. Carlo Magni for his contribution to the discussion of this paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Iezzi, D.F., Mastrangelo, M., Sarlo, S. (2013). A New Fuzzy Method to Classify Professional Profiles from Job Announcements. In: Giudici, P., Ingrassia, S., Vichi, M. (eds) Statistical Models for Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Heidelberg. https://doi.org/10.1007/978-3-319-00032-9_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-00032-9_18
Published:
Publisher Name: Springer, Heidelberg
Print ISBN: 978-3-319-00031-2
Online ISBN: 978-3-319-00032-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)