Term extractor: I don't get the same result as in your web page

Asked by Esteban Feldman

Hi, using your code in your web page and in my machine I get 2 different results:

From your web page:

[
    "travel",
    "aires",
    "buenos"
]

from my code (that is your code):

[u'travel', u'bueno', u'aire'] you can see that the 's' has been removed.

Are you using the same code in your website than in the launchpad repository or there is a different setting for the DefaultFilter?

Thanks

Question information

Language:
English Edit question
Status:
Answered
For:
Five Filters Edit question
Assignee:
No assignee Edit question
Last query:
Last reply:
Revision history for this message
Esteban Feldman (ekagaurangadas) said :
#1

I found that in the file topia/termextract/tag.py line 90

you have this

    rules = (
        #correctDefaultNounTag,
        verifyProperNounAtSentenceStart,
        determineVerbAfterModal,
        #normalizePluralForms,
        )

in mine isn't commented

    rules = (
        correctDefaultNounTag,
        verifyProperNounAtSentenceStart,
        determineVerbAfterModal,
        normalizePluralForms,
        )

is there any reason for this?

Thanks

Revision history for this message
Keyvan (keyvan) said :
#2

Esteban: yes, I commented those two rules out because I didn't want the terms to be changed in the results. Feel free to uncomment if you're happy keeping those rules in place.

Can you help with this problem?

Provide an answer of your own, or ask Esteban Feldman for more information if necessary.

To post a message you must log in.