GSoC 2024: Integrating imports into BookBrainz

It really bugged me that it proved impossible to finish the huge BookBrainz importer project last year. Fortunately MetaBrainz (and Google) gave me the chance to continue working on my 2023 project during this Summer of Code, thank you! Our goal is still to import huge external datasets into the BookBrainz database schema.

Last year I worked on the backend services to transform and insert simple entities into the database. This year’s goal was to support importing multiple related entities and exposing the imported data on the website. We can now import entities (on the backend), which can be reviewed and approved by our users with ease.

If you want to know the full story, I recommend you to start with my previous blog post to learn more details about the existing importer infrastructure and last year’s problems. Or just read on if you are only interested in the advanced stuff which I did this year.

Continue reading “GSoC 2024: Integrating imports into BookBrainz”

GSoC 2023: Reviving the BookBrainz importer project

Hi, I am David Kellner (aka kellnerd), an electrical engineering student from Germany, who has finally found the time to participate in Summer of Code after four years of contributing data, bug reports and a bit of code to MetaBrainz projects (mostly to MusicBrainz and related tools such as userscripts).

Although I had mostly worked with MusicBrainz so far, I decided to apply for the BookBrainz importer project as I was already familiar with the underlying JavaScript technology and saw the huge potential of the idea to transform and import external datasets into the cleverly designed BookBrainz database schema. My proposed project was accepted by the MetaBrainz team and I have been working on it for the last six months under the mentorship of monkey.

This post gives an overview about my GSoC project and the challenges which I encountered during this summer.

Continue reading “GSoC 2023: Reviving the BookBrainz importer project”