Beyond Bichitra

Chaudhuri, Sukanta

doi:10.1007/978-3-319-23678-0_10

Sukanta Chaudhuri⁷

Part of the book series: Quantitative Methods in the Humanities and Social Sciences ((QMHSS))

347 Accesses

Abstract

This chapter is divided into three parts. The first suggests some improvements to Bichitra as it stands, like direct access to particular images, synchronization of image and transcript, and provision of more links. The second suggests value additions in the form of new functions like topic modelling and more extensive multimedia components. It also points out the innovation involved in working these functions in non-Latin fonts. The last and longest section, ‘New Directions’, comprises the project director’s suggestions and speculations about using the Bichitra corpus for further development of textual computing generally. In particular, he argues for the special promise held out by large textual corpora in developing more precise ways of analyzing ‘big data’, paying more attention to their detailed contents. To achieve this, he suggests a series of progressively refined investigations of textual data in the light of its context: semantic and syntactical, literal and metaphoric.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Programs like ‘Knowledge Vault’ (Dong et al. 2014) parse big data semantically and/or syntactically to extract knowledge, but they chiefly extract the data automatically by measuring multiple occurrence. Google’s Knowledge Graph goes far in this direction. But all such programs extract knowledge out of the text, rather than probe more deeply into the verbal dimension per se. Their thrust is informational, not textual. Yet such in-depth textual analysis would be invaluable in creating a new generation of knowledge-extraction programs.
2.
In fact, infrequent items may be excluded on the grounds of small sample size (Mimno 2012, 5).
3.
See Mimno 2012 for an illustration of some basic possibilities in practice, especially the ‘broad areas’ indicated on page 17.
4.
‘The linguistic sign unites, not a thing and a name, but a concept and a sound-image.’ (Saussure, 1975, 66). This notion underlies Ogden and Richards’s ‘semantic triangle’ linking an object, the thought of that object, and the word or sign representing it (Ogden and Richards 1923, 5).
5.
See, e.g., Kwiatkowski et al. 2013, Das et al. 2014.
6.
Even the elaborate ontology of the ambitious ‘Read the Web’, designed to comb the resources of the entire Web, currently comprises factual or firmly denotative categories alone (Read the Web, http://rtw.ml.cmu.edu/rtw/kbbrowser/)—almost necessarily in view of the vast corpus involved.
7.
Hayles 2002: see especially Chap. 2.

References

Das, Dipanjan et al., 2014. Frame-Semantic Parsing. Computational Linguistics 40:1, 10–56. http://www.mitpressjournals.org/doi/pdf/10.1162/COLI_a_00163. Accessed 12 January 2015.
Docuscope. http://www.cmu.edu/hss/english/research/docuscope.html. Accessed 24 December 2014.
Dong, Xin Luna et al. 2014. Knowledge Vault: A Web-Scale Approach to Probabilistic Knowledge Fusion. Proceedings on the Conference for Knowledge Discovery and Data-Mining. https://www.cs.cmu.edu/~nlao/publication/2014.kdd.pdf. Accessed 12 January 2015.
Hayles, N. Katherine. 2002. Writing Machines. Cambridge, Mass: MIT.
Google Scholar
Hope, Jonathan and Michael Witmore. 2004. The Very Large Textual Object: A Prosthetic Reading of Shakespeare. Early Modern Literary Studies 9.3/Special Issue 12, January. http://extra.shu.ac.uk/emls/09-3/hopewhit.htm. Accessed 24 December 2014.
Kwiatkowski Tom et al. 2013. Scaling Semantic Parsers with On-the-Fly Ontology Matching. Proceedings of the Conference on Empirical Methods in Natural Language Processing, October: 1545–56. http://homes.cs.washington.edu/~lsz/papers/kcaz-emnlp13.pdf. Accessed 12 January 2015.
Mayer-Schönberger, Viktor and Kenneth Cukier. 2013. Big Data: A Revolution That Will Transform How We Live, Work and Think. London: John Murray.
Google Scholar
Mimno, David. 2012. Computational Historiography: Data Mining in a Century of Classics Journals. Journal on Computing and Cultural Heritage, 5:1, 1–19.
Article Google Scholar
Montaigne, Michel de. 1958. Complete Essays. Trans. Donald M. Frame, 1957. Stanford: Stanford University Press.
Google Scholar
Moretti, Franco. 2005. Graphs, Maps, Trees: Abstract Models for Literary History. London: Verso.
Google Scholar
Ogden, C.K. and I.A. Richards. 1923. The Meaning of Meaning. London: Kegan Paul.
Google Scholar
Read the Web. http://rtw.ml.cmu.edu/rtw/. Accessed 12 January 2015.
Saussure, Ferdinand de. 1975. Course in General Linguistics. Trans. Wade Baskin, 1959. London: Fontana/Collins.
Google Scholar
Stray, Jonathan. 2010. A Full-Text Visualization of the Iraq War Logs. http://jonathanstray.com/a-full-text-visualization-of-the-iraq-war-logs. Accessed 12 January 2015.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of English, Jadavpur University, Kolkata, India
Sukanta Chaudhuri

Authors

Sukanta Chaudhuri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sukanta Chaudhuri .

Editor information

Editors and Affiliations

Department of English, Jadavpur University, Kolkata, India
Sukanta Chaudhuri

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Chaudhuri, S. (2015). Beyond Bichitra. In: Chaudhuri, S. (eds) Bichitra: The Making of an Online Tagore Variorum. Quantitative Methods in the Humanities and Social Sciences. Springer, Cham. https://doi.org/10.1007/978-3-319-23678-0_10

Download citation

DOI: https://doi.org/10.1007/978-3-319-23678-0_10
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23677-3
Online ISBN: 978-3-319-23678-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics