add project group "datahub"

Asked by lszyba1

Hello,
datahub is a template for creating a "python based apps" that can do crawl,parse, load. From there user creates his projects and would be nice if he can become part of launchpad datahub group of projects that parse government or other databases.

From the basic template that datahub program has you can create subprojects.

1. One of the parser is https://code.launchpad.net/~szybalski/datahub.gov.fueleconomy/trunk this parser fueleconomy.gov epa data.
2. I am working on manufacture parsers, datahub.gov.dot.nhtsa.manufacturer that parser list of manufacturers that sell cars in USA.
3. https://launchpad.net/datahub.gov.dot.nhtsa.recall parses the recall database from nhtsa.dot.gov see results here: http://lucasmanual.com/recall (check out the project, untar and just run process.sh and it will load everything into a database nicely parsed)

"Cogbin" is a tool that displays all python packages that have a keyword "datahub" that you can set in settings.https://code.launchpad.net/~szybalski/cogbin/trunk

I would like to link all these 4 projects as part of one group "datahub". From there people on launchpad can search the group and find useful parsers for data they might be looking at, add more and keep the group growing.

Thank you,
Lucas

Question information

Language:
English Edit question
Status:
Answered
For:
Launchpad itself Edit question
Assignee:
Curtis Hovey Edit question
Last query:
Last reply:
Whiteboard:
Once the project is renamed, assign the question to sinzui who can finish the project group setup.
Revision history for this message
Leonard Richardson (leonardr) said :
#1

Because there is already a (fairly active) 'datahub' project, I can't handle this myself because an existing project cannot be promoted to project group. The 'datahub' project needs to be renamed (which only a LOSA can do) and disabled, and a new 'datahub' project group created.

For LOSA convenience (or whoever's doing CHR when the LOSAs send this back), here's some information about the 'datahub' project group, taken from the existing 'datahub' project and from szybalski's original question.

Display name:
DataHub

Summary:
DataHub is a tool that allows you to quickly find and create data mining programs that are able to crawl, parse, and load the data source into database or other types of useful forms.

Description:
Datahub is a tool that allows faster download/crawl, parse, load, and visualize of data. It achieves this by allowing you to divide each step into its own work folders. In each work folder you get a sample files that you can start coding.
Datahub is for people who found some interesting data source for them, they want to download it, parse it, load it into database, provide some documentation, and visualize it.
Datahub starts with a base default template and you fill in the blanks. Fill in a download_list.txt in crawl folder, fill in the parse code in parse folder, fill in the database structure and load code in load folder...

Initial member projects:
datahub.gov.fueleconomy
datahub.gov.dot.nhtsa.recall
cogbin

Revision history for this message
Curtis Hovey (sinzui) said :
#2

The last message was a comment, not an answer.

Revision history for this message
Michael Barnett (mbarnett) said :
#3

I have renamed this project to datahub-old and disabled it.

Reassiging ticket for project group setup.

-Michael

Revision history for this message
Curtis Hovey (sinzui) said :
#4

I created https://launchpad.net/datahub. You can add projects to this project group using the Change details link on eahc project page. Set the "Part of" field to "darahub"

Can you help with this problem?

Provide an answer of your own, or ask lszyba1 for more information if necessary.

To post a message you must log in.