A Novel System for Generating Simple Sentences from Complex and Compound Sentences

Full Text (PDF, 440KB), PP.57-64

Views: 0 Downloads: 0

Author(s)

Bidyut Das 1,* Mukta Majumder 2 Santanu Phadikar 3

1. Department of Information Technology, Haldia Institute of Technology, Haldia-721657, India

2. Department of Computer Science and Application, University of North Bengal, Siliguri-734013, India

3. Department of Computer Science and Engineering, Maulana Abul Kalam Azad University of Technology, West Bengal-700064, India

* Corresponding author.

DOI: https://doi.org/10.5815/ijmecs.2018.01.06

Received: 22 Oct. 2017 / Revised: 9 Nov. 2017 / Accepted: 29 Nov. 2017 / Published: 8 Jan. 2018

Index Terms

Sentence simplification, Dependency parsing, Co-reference resolution, Information extraction, Natural language processing

Abstract

In the field of natural language processing, simple sentence has a great importance; especially for multiple choice question generation, automatic text summarization, opinion mining, machine translation and information retrieval etc. Most of these tasks use simple sentences and include a sentence simplification module as pre-processing or post-processing task. But dedicated tasks for sentence simplification are hardly found. Here we have proposed a novel system for generating simple sentences from complex and compound sentences. Our proposed system is an initiative for simplifying sentence by converting complex and compound sentences into simple ones. Along with this the system classifies the simple sentences of an input corpus from other types of sentences. To generate simple sentences from complex and compound sentences we have proposed a novel algorithm which takes the dependency parsing of the input text and produce simple sentences as output. The experimental result demonstrates that the proposed technique is a promising one.

Cite This Paper

Bidyut Das, Mukta Majumder, Santanu Phadikar, "A Novel System for Generating Simple Sentences from Complex and Compound Sentences", International Journal of Modern Education and Computer Science(IJMECS), Vol.10, No.1, pp. 57-64, 2018.DOI: 10.5815/ijmecs.2018.01.06

Reference

[1] S. Wubben, A.V.D. Bosch, E. Krahmer, “Sentence simpli_cation by monolingual machine translation,” Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Long Papers-Volume 1, pp. 1015-1024, 2012.

[2] D. Vickrey, D. Koller, “Sentence simpli_cation for semantic role labeling,” Proceedings of the Association for Computational Linguistics, pp. 344-352, 2008.

[3] C. Poornima, V. Dhanalakshmi, K.M. Anand, K.P. Soman, “Rule based sentence simplification for english to tamil machine translation system”, International Journal of Computer Applications vol. 25(8), pp. 38-42, 2011.

[4] A. Bawakid, M. Oussalah, “Sentences simplification for automatic summarization,” Proceedings of IEEE 10th International Conference of Cybernetic Intelligent Systems (CIS), IEEE, pp. 59-64, 2011.

[5] F.A. Tarouti, J, Kalita, C. McGrory, “Sentence simplification for question generation,” Proceedings of the International Conference on Computing and Communication Systems, 2015, http://www.cs.uccs.edu/ jkalita/papers/2015/AltaroutiFerasI3CS2015.pdf

[6] M. Majumder, S.K. Saha, “A system for generating multiple choice questions: With a novel approach for sentence selection,” ACL (Association for Computational Linguistics), IJCNLP, p. 64, 2015.

[7] T.B. McArthur, F. McArthur, “The Oxford Companion to the English Language, Oxford Companions Series,” Oxford University Press, 1992, https://books.google.co.in/ books?id=yIoYAAAAIAAJ

[8] B. Backman, “Building Sentence Skills: Tools for Writing the Amazing English Sentence,” Teacher Created Resources, Incorporated: Middle School Series, 2003, https://books.google.co.in/books?id=n-0wXZf4In4C

[9] G. Lutz, D. Stevenson, “The Writer's Digest Grammar Desk Reference,” F+W Media, 2005, https://books.google.co.in/books?id=SsQ9ugnMcpUC

[10] F. Obrecht, “Minimum Essentials of English. Barron's Educational Series, 1999, https://books.google.co.in/ books?id= j4yE7y5c NoC

[11] T.P. Klammer, R.M. Shultz, A.D. Volpe, “Analyzing English Grammar,” Pearson Education, India, 2007

[12] R. Chandrasekar, C. Doran, B. Srinivas, “Motivations and methods for text simplification,” Proceedings of the 16th Conference on Computational linguistics (Association for Computational Linguistics), vol. 2, pp. 1041-1044, 1996

[13] Z. Zhu, D. Bernhard, I. Gurevych, “A monolingual tree-based translation model for sentence simplification,” Proceedings of the 23rd International Conference on Computational Linguistics (Association for Computational Linguistics), pp. 1353-1361, 2010.

[14] M. Heilman, N.A. Smith, “Extracting simplified statements for factual question generation,” Proceedings of QG2010: the Third Workshop on Question Generation, pp. 11-20, 2010.

[15] M. Miwa, R. Satre, Y. Miyao, J.I. Tsujii, “Entity-focused sentence simplification for relation extraction,” Proceedings of the 23rd International Conference on Computational Linguistics (Association for Computational Linguistics), pp. 788-796, 2010.

[16] O. Biran, S. Brody, N. Elhadad, “Putting it simply: a context-aware approach to lexical simplification,” Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Short papers-vol. 2, pp. 496-501, 2011.

[17] G. Tur, D. Hakkani-Tur, L. Heck, S. Parthasarathy, “Sentence simplification for spoken language understanding,” Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, pp. 5628-5631, 2011.

[18] L. Brouwers, D. Bernhard, A.L. Ligozat, T. Francois, “Syntactic sentence simplification for French,” Proceedings of the 3rd Workshop on Predicting and Improving Text Readability for Target Reader Populations (PITR), EACL, pp. 47-56, 2014.

[19] S.K.D. Nikita, S.K. Sharma, “Detection of complex sentences in Punjabi language using CRF,” Journal of Innovation in Electronics and Communication Engineering vol. 5(2), pp. 42-46, 2015.

[20] Chandni, R. Narula, S.K. Sharma, “Identification and separation of simple, compound and complex sentences in Punjabi language,” International Journal of Computer Applications & Information Technology, vol. 6, pp.123-128, 2014.

[21] M.C. De Marneffe, C.D. Manning, “Stanford typed dependencies manual,” Technical report, Stanford University, 2008, https://nlp.stanford.edu/software/ dependencies manual.pdf

[22] R. Kokare, K. Wanjale, “A Natural Language Query Builder Interface for Structured Databases Using Dependency Parsing,” International Journal of Mathematical Sciences and Computing, vol. 1(4), pp. 11-20, 2015.

[23] P. Atteberry, “Sentence Types,” http://www.pitt.edu/ atteberr/comp/0150/grammar/sentencetypes.html

[24] B. Santorini, “Part-of-speech tagging guidelines for the penn treebank project (3rd revision)”, 1990.

[25] R. Khoury, “Sentence Clustering Using Parts-of-Speech,” International Journal of Information Engineering and Electronic Business, vol. 4(1), pp.1-9, 2012.