Mining Launchpad for process analysis

Asked by Aram Morera Mesa

Hi, I am doing research in workflows in localisation and I would love to do data mining using data from Launchpad. I am mainly interested in the logs that have a connection with Rosetta, since I am looking at localisation processes.

Is there someone I can contact regarding this?

Cheers,

Aram

Question information

Language:
English Edit question
Status:
Answered
For:
Launchpad itself Edit question
Assignee:
No assignee Edit question
Last query:
Last reply:
Revision history for this message
Benji York (benji) said :
#1

What information do you want?

Revision history for this message
Aram Morera Mesa (aram-mm-gmail) said :
#2

I do not know what components there are in Launchpad, so it is not easy to ask. I am interested in the kind of interactions that happen in Rosetta. I do not need IPs, names or anything that could compromise a person's privacy: I just need some manner to identify unique users, what they do and when they do it. I am interested in seeing things like the number of translations for one string that are suggested before a reviewer accepts it, how many people do suggestions for one project, how often people click on "someone should review this translation", how long does it take for a reviewers to accept translations, if people use this https://blueprints.launchpad.net/launchpad/+spec/suggestion-approve-rejection-explanation... I guess anonymized logs for a few projects and maybe two languages per project (let's say Spanish and German) would be the way to go for me, but I do not know if that is possible. I can see in the interface that the suggestions, the authors and the times are stored, I guess in a database, so the anonymized versions of the relevant tables would probably work well for me too. I am also keen on seeing the suggestions Rosetta makes using the translations used in other packages. Sorry about the lack of specificity, I hope this gives you an idea, though. It would really be amazingly helpful for my research if I could get that kind of data.

Revision history for this message
Benji York (benji) said :
#3

Unfortunately we don't collect the data you're interested in.

Revision history for this message
Martin Pool (mbp) said :
#4

Aram, can you give us some background (maybe a web page) about your affiliation and your research project?

Revision history for this message
Aram_mm (aram-mm) said :
#5

Thanks for your answer Martin. I am part of the CNGL, you can find me on the list of researchers here:
http://www.cngl.ie/academicpart.html
You can also find me here, although the topic has to be updated.
http://www.localisation.ie/resources/Research/phdresearch.htm
If you look at the programme of this conference you can see that I gave a talk in it.
http://tradumatica.uab.cat/conference/en/introduction.htm
What I am looking at now is the integration of the community in highly automated workflows. I have been looking at tools like Pootle, Crowdin and Launchpad's Rosetta.
I hope that is enough, if you have any specific information request, let me know.

Can you help with this problem?

Provide an answer of your own, or ask Aram Morera Mesa for more information if necessary.

To post a message you must log in.