Here's the problem:
Word POS
AB'C' NNP
DEF' CC
GH'I' NNP
JKL ' CD
MN'O' CG
->In this input the first column consists of words and the second column depicts the Parts Of Speech(POS) of each word. ' used as the syllabic boundary. So a word might be just one syllable or more than one syllables. [Our language does not have word boundaries just like in English language. We have syllabic and sentence boundaries.]
Input2:
[AB'C'DEF'GH'I'] [JKL'MN'O']
->This is an input consisting of phrases without the word boundaries. While the first input is same as input2 in terms of the words they consist but input2 does not have word boundaries marked between words. So the task is to compare the two input files and get a result where the phrases in input2 has spaces in between words as word boundaries. And then for each word in the resulting file, the no. of syllables in it and it's corresponding POS must be output to a file.
Output:
[AB'C' DEF' GH'I'] [JKL' MN'O']
Here the words are separated by spaces in each phrase. No. of words in the first phrase is 3 while the second phrase has only 2 words.
Final output:
Using the above result, no. of syllables in each word and the POS of each word must be found. As depicted below:
[2/NNP 1/CC 2/NNP] [1/CD 2/CG]
Help me!
Thanks.
Word POS
AB'C' NNP
DEF' CC
GH'I' NNP
JKL ' CD
MN'O' CG
->In this input the first column consists of words and the second column depicts the Parts Of Speech(POS) of each word. ' used as the syllabic boundary. So a word might be just one syllable or more than one syllables. [Our language does not have word boundaries just like in English language. We have syllabic and sentence boundaries.]
Input2:
[AB'C'DEF'GH'I'] [JKL'MN'O']
->This is an input consisting of phrases without the word boundaries. While the first input is same as input2 in terms of the words they consist but input2 does not have word boundaries marked between words. So the task is to compare the two input files and get a result where the phrases in input2 has spaces in between words as word boundaries. And then for each word in the resulting file, the no. of syllables in it and it's corresponding POS must be output to a file.
Output:
[AB'C' DEF' GH'I'] [JKL' MN'O']
Here the words are separated by spaces in each phrase. No. of words in the first phrase is 3 while the second phrase has only 2 words.
Final output:
Using the above result, no. of syllables in each word and the POS of each word must be found. As depicted below:
[2/NNP 1/CC 2/NNP] [1/CD 2/CG]
Help me!
Thanks.
Comment