Guess case for classical music

Keschte (g0llum) says: This concerns mostly the classical editors. I’ve finally taken my time to develop the requested guess case mode for the classical style guidelines. These are mostly regular expressions which cover most of the cases that require tedious manual editing. You’ll find some of the examples I’ve worked with in the header of, … Continue reading “Guess case for classical music”

Keschte (g0llum) says:

This concerns mostly the classical editors. I’ve finally taken my time to develop the requested guess case mode for the classical style guidelines. These are mostly regular expressions which cover most of the cases that require tedious manual editing. You’ll find some of the examples I’ve worked with in the header of, please go to the sandbox and try out your titles. Feel free to enter any issues you find into the bug tracker.

Cheers, and have fun testing!

–keschte

Technorati Tags: ,

Wanted: Documentation writer for MusicBrainz' MMD XML Schema

Matthias Friedrich just declared the MMD XML Schema to be stable. Hooray, and thanks for your hard work on creating this schema! The MusicBrainzXMLMetaData wiki page describes this new schema, but it does not provide complete documentation for the new schema. We’re looking for a volunteer to take this page and: Describe the entire schema … Continue reading “Wanted: Documentation writer for MusicBrainz' MMD XML Schema”

Matthias Friedrich just declared the MMD XML Schema to be stable. Hooray, and thanks for your hard work on creating this schema!

The MusicBrainzXMLMetaData wiki page describes this new schema, but it does not provide complete documentation for the new schema. We’re looking for a volunteer to take this page and:

  1. Describe the entire schema in english, with as little geeky talk as possible.
  2. Flesh out the existing examples and add more examples to describe the various aspects of the schema.
  3. Receive community feedback and revise the documentation

The person who decides to take this on needs to understand XML and preferably the Relax NG XML Schema language. The latter is not a must — we can help the documentation writer understand the schema, but knowledge of XML is crucial for this task.

If you are interested in helping out, please post a comment to this entry.

Thanks!

Technorati Tags:

Future directions for MusicBrainz

I’ve returned from my much needed vacation in Iceland and now I am ready to get back to working on MusicBrainz. While I was gone, a few shouting matches and arguments over what MusicBrainz should be in the future erupted, so its clear that its high time to give a general update on how MusicBrainz … Continue reading “Future directions for MusicBrainz”

I’ve returned from my much needed vacation in Iceland and now I am ready to get back to working on MusicBrainz. While I was gone, a few shouting matches and arguments over what MusicBrainz should be in the future erupted, so its clear that its high time to give a general update on how MusicBrainz is doing and where we’re headed in the future.

First, lets review what we’ve accomplished in the past 4 months: 1) we have more server capacity in a new home with better bandwidth 2) A new fingerprinting system with a new partner 3) A new search engine 4) A new web service. If you would’ve asked me how long all these would take to turn into a reality 4 months ago, I would’ve told you 6 – 8 months time. So, we’ve made great strides this year alone!

Now that we’ve knocked off a number of serious problems and improved the overall service, the community looks towards the next set of problems that we need to tackle. At first glance, it may seem that all we have are problems and that there are tons of people who are constantly complaining that MusicBrainz is not this or that. Personally, I think this is interesting and not alarming — these “problems” show that people care about the project. None of these problems put MusicBrainz in danger of vanishing tomorrow. I think the biggest problem right now is that the future for the project is not clear now that we’ve implemented many large improvements over the last few months. This blog post and more to follow next week should hopefully address these questions from a high level perspective:

Q: Is MusicBrainz a service aimed at people who wish to clean up (tag) their music collection or is the goal to create a music encyclopedia?

A: Yes! The long term goal of MusicBrainz is to capture all relevant knowledge about music and create a comprehensive music encyclopedia. The goal is also to create killer tagging applications that take this wealth of knowledge and let users apply it to their own music collections.

Thus, when people edit the database, the focus should be to capture the information as accurately as possible, respecting artist intent and trying to work with our guidelines when artist intent is not clear. The focus should not be to capture information such that music collections can get tagged cleanly with the data!

That is not to say that we don’t care about tagger users — on the contrary! Tagger users who make an occasional $10 donation are the people who pay our bills — they keep the servers on and allow the foundation to have an official place of business!

To make both the encyclopedia minded users and tagger users happy in one giant sandbox, I’d like to present a rough road-map where MusicBrainz will be headed in the near future:

Next server update

Server update with UI improvements, nomenclature (album -> release, moderate -> edit) fixes, album editor, XHTML support and more. This is likely to happen mid to end of May and driven by the hard efforts of Keschte.

Picard user interface improvements

Picard users currently fall into two categories — those who hate it and those who love it. If you don’t like drag and drop and you focus mainly on tracks, you are not likely to enjoy Picard. The user interface improvements presented here will be implemented so that the UI can be used without drag and drop and either in a track or album oriented mode. Which exact model we’ll pursue is unclear at this point, but it will likely be one of the variants proposed there. The overall goal is to make the old MusicBrainz Tagger irrelevant as we prepare to put TRM out to pasture — all tagger users should be happy with Picard.

TaggerScript in Picard

TaggerScript is the nick name we’ve given a much discussed, but not yet specified feature that will allow tagger users more control over how their music collection gets organized. The idea is that with TaggerScript, users will be able to extract information from AdvancedRelationships as well as the usual pieces of release data and then shove that data into the tags/filenames of their collection with a lot more flexibility and control that we currently allow. TaggerScript will allow tagger oriented users to extract the data they care about from the encyclopedia oriented database.

Next generation schema

This is the much discussed and much anticipated major upgrade to the MusicBrainz database. The idea behind this is to allow us to handle releases, classical music and many other facets of music metadata much better than we can today. At Summit #7 we worked for 14 hours to create this new schema and its a great start for defining the goal for a more powerful version of MusicBrainz. However, simply because this new schema exists, it does not mean that we will implement this as it currently stands on the wiki page. We need to spend a lot more time thinking about this — but this first version serves as a great stepping stone for eventually getting to our goal.

The most serious problem with this schema is that it will take a huge amount of effort for us implement it. Essentially, it amounts to rewriting most of MusicBrainz. Think one person working on it full time for 12 months — maybe even 24 months. There are a number of problems with this:

  1. If the dev team went away for 12 – 24 months to work on the next version of MusicBrainz, the current users would lose interest in MB due to the lack of progress. If the end-users cannot witness progress being made, they lose interest. So, devs cannot just work on the next gen schema, they also have to go back and fix other issues that arise. That pushes things out even further. 36 months? Ugh.
  2. MusicBrainz is still being coded by volunteers, and volunteers work on personal motivation. If a person is not motivated to work on a huge project for months on end without pay, they will lose interest. Moving from our current schema to the next schema is going to be rough work and a lot of it. I’m sure we don’t have enough volunteers to make this happen!
  3. For large projects like these, when you finally get done with the project it may no longer be what you need when its finished. It will be what you needed 12 – 24 months ago, not what you need today.

So, then how to we proceed with this mess? There are a number of options on how to proceed — we should attempt to work on all of these approaches at the same time:

  1. Work to sell more data licenses. This non-trivial income will then allow us to hire developers to work on the MB server. Paid people can be properly motivated to work on longer projects.
  2. Work to figure out how to break the schema upgrade into 3-4 smaller upgrades, each taking a 2-3 months to complete, thus making visible progress on a continual basis. [ insert wild hand waving here — I have no clue how to accomplish this ]
  3. Possibly create more tools, abstraction layers or a new moderation system that will overall reduce the total amount work needed to move to a new schema. Here too, we’re brainstorming about how to proceed — nothing concrete has emerged yet.

As you may have guessed, we’re not certain on how to proceed with this new schema — we have a lot to think about and a lot of discussions to hold. Certain is that we will not see this next generation schema come to fruition this year. If you’re holding your breath on the new schema and you cannot deal with MusicBrainz’ shortcomings for at least another year, you may want to find another approach to satisfy your music metadata cravings.

One thing I do know for sure is that I am excited to continue working on MusicBrainz. We’ve accomplished a lot in the last few months and we’re not about to stop working hard on this project.

Onward ho!

Technorati Tags: , ,

New advanced relationship: MySpace artist URL

(In an effort to spread the news about changes to the Advanced Relationship link types, I’m posting this and hopefully future changes here). Lukas has just added an Advanced Relationship that lets users link artists to their MySpace pages: forward: has a MySpace page at reverse: is a MySpace page for description: This relationship type … Continue reading “New advanced relationship: MySpace artist URL”

(In an effort to spread the news about changes to the Advanced Relationship link types, I’m posting this and hopefully future changes here).

Lukas has just added an Advanced Relationship that lets users link artists to their MySpace pages:

  • forward: has a MySpace page at
  • reverse: is a MySpace page for
  • description: This relationship type can be used to link a MusicBrainz artist to the equivalent entry in MySpace. Please, use URLs in format http://www.myspace.com/[artist]

Technorati Tags: ,

Fitting quote

Yesterday at the Santa Barbara Forum on Digital Transitions, I had a chance to pick Clay Shriky’s brain about online consensus and what it means to bring about consensus in an online community. We didn’t come up with much revolutionary other than what we’ve already been doing: Watch a lively discussion (perhaps flame war) arise … Continue reading “Fitting quote”

Yesterday at the Santa Barbara Forum on Digital Transitions, I had a chance to pick Clay Shriky’s brain about online consensus and what it means to bring about consensus in an online community. We didn’t come up with much revolutionary other than what we’ve already been doing: Watch a lively discussion (perhaps flame war) arise and subside. Then pick through the rubble to see if something akin to a consensus was found. If so, great. If not, wait a little while and ask the same question over while framing it better in light of the last discussion. Repeat as necessary and involve evil overlord as last resort.

Clay did share a quote from David Clark that was very fitting to our discussion:

“We reject kings, presidents and voting. We believe in rough consensus and running code.”

While its not perfect, it does show what we do about 98% of the time.

P.S. I was video blogged too — take a look: I won’t stand still while enthusiastically spouting off about MusicBrainz.

Technorati Tags: ,

Picard 0.7.0-beta2 released!

Lukas says: Picard version 0.7.0 Beta 2 has been just released. This version contains mostly bug fixes from the previous beta version. See the full list for details. Downloads: picard-0.7.0-beta2.tar.gz Linux tarball (287 kB) picard-setup-0.7.0-beta2.exe Windows installer (5.62 MB) I’ve also compiled a packages for Ubuntu 5.10 “Breezy Badger”, so Ubuntu users can install Picard … Continue reading “Picard 0.7.0-beta2 released!”

Lukas says:

Picard version 0.7.0 Beta 2 has been just released. This version contains mostly bug fixes from the previous beta version. See the full list for details.

Downloads:

I’ve also compiled a packages for Ubuntu 5.10 “Breezy Badger”, so Ubuntu users can install Picard by “apt-get install picard” or Synaptic. See PicardLinuxInstall for details.

Lukas

Technorati Tags: ,

PUIDs: WTF?

Given that my elaborations are not making things more clear, I will try one last time to make things more clear. But before I go on, remember that there are bugs in the system right now. What I am going to explain is NOT how its currently working. If you observe behavior that is different … Continue reading “PUIDs: WTF?”

Given that my elaborations are not making things more clear, I will try one last time to make things more clear. But before I go on, remember that there are bugs in the system right now. What I am going to explain is NOT how its currently working. If you observe behavior that is different from what I am saying, its BECAUSE THE SYSTEM ISN’T WORKING PROPERLY YET! If you’re confused and don’t understand, stop using Picard 0.7.0.

Music analysis vs fingerprinting:

There are two processes that MusicIP makes available. Fingerprinting and music analysis. Let’s touch on music analysis first — this is the process that takes a while (Yes, I know its slow. Yes, I know its going to take days to analyze your collection. Please stop telling me that!) The process of doing music analysis examines up to 10 minutes of a track and examines all sorts of things I know nothing about. All I know is that in order to generate a new PUID, you must analyze a track fully. This full data collection portion is what allows the MusicIP mixer to generate playlists of similar music. This is the secret sauce that makes MusicIP tick and thus this is not going to be open sourced, no matter how much we ask.

Fingerprinting is much smaller process — it only analyzes about 2 minutes of the track. You cannot generate a new PUID to insert into the database from the fingerprinting process. There is not enough information in this process. There is enough information to create a PUID that is suitable for doing an identification, but not for submission. This works a lot different than TRM — this system doesn’t create rampant amounts of useless fingerprints that will never be used.

Submitting PUID’s from Picard:

When the process works, you do a full music analysis on a track and the system generates a PUID for you within 24 hours. But this is not working right now, so Picard will not likely prompt you to submit PUIDs. This means the submit button stays greyed out.

Why can’t we have Picard generate PUIDs?

Because we don’t have that code. Please stop asking for ponies — we’re fresh out and you can’t have one. Telling us that this sucks won’t make it any better. Please keep your comments to constructive criticism.

I don’t understand — its not working as you say it should:

Did I mention that there are some bugs? If it doesn’t work for you, stop using it. We’ll fix it.

Rude comments:

“Reading up on good interface design would also be a suggestion Just google good user interface design” — Please tell me where your FREE software is so I can download it and insult your hard work. I’m sorry that this FREE program isn’t working for you.

Technorati Tags: ,

Picard: Common questions answered

There seem to be many questions about Picard at the moment and a lot of people who love the old tagger interface and are frustrated with Picard’s UI. So, let me address the feedback we’re getting from our users: “MBTagger was perfect. Picard sucks.” — Ok, this feedback doesn’t help much, but we understand that … Continue reading “Picard: Common questions answered”

There seem to be many questions about Picard at the moment and a lot of people who love the old tagger interface and are frustrated with Picard’s UI. So, let me address the feedback we’re getting from our users:

  • MBTagger was perfect. Picard sucks.” — Ok, this feedback doesn’t help much, but we understand that you are frustrated. We’re working to improve Picard as we speak and the latest release is a beta release, so please bear with us.
  • I don’t understand the Picard interface.” — We understand that there are some people who prefer to tag tracks, not albums. We also understand that the UI is not immediately intuitive. We have plans to fix this, and to allow Picard to be usable by people who prefer to tag tracks and those who prefer to tag albums. For our thoughts on this improved interface, check out this wiki page. For more documentation on how to actually use Picard, see HowToTagFilesWithPicard. And to calm down all the people who are threatening to leave the project if the old MBTagger goes away, we plan to improve the interface of Picard way before the old MBTagger stops working.
  • Picard keeps crashing on me.” — It works for us, which is sucky since we can’t fix your crashes if it doesn’t crash for us. We’re actively soliciting feedback from people who can reproduce crashes. If you have a case where Picard crashes for you consistently and repeatably, please file a bug report.
  • Picard screws up my files!” — First, check to make sure you’re writing the right version of id3v2 tags. Depending on what other programs you use, you may need to switch to version 2.3/2.4 or vice versa. We know about some incompatibilities between the MusicIP Mixer and Picard. We’re going to start working on these issues soon.
  • How do I get PUID’s into MB?” — This answer has two parts. First, to create a new PUID for a file that MB doesn’t have a PUID for currently, you need to download and run the file through the MusicIP Mixer. (Yes, we know not everyone wants to use the Mixer application — for you we’re going to create stand-alone analyzer applications soon). Once a file goes through the MusicIP Mixer, in theory it should become available to the MusicDNS.org service (that MusicBrainz uses) within 24 hours. In practice, this is not happening yet. The MusicIP folks are working on this! Once the MusicDNS.org service has a PUID, you can re run the file through Picard and it should pick up the PUID and prompt you to submit the PUID to MB. Again, there may be bugs in the process — we’re working to iron out the bugs as we speak.

What can you do if you’re affected by these issues? Here is a quick check list for you:

  1. Do you have an issue not addressed herein?
  2. Can you reproduce the issue?
  3. Yes: Please review the open bugs for picard and if this bug doesn’t already exist, please add a new one.
  4. No: Please stop using Picard 0.7.0 for the time being and go back to Picard 0.6.0 or the old MB Tagger. Then wait for a new release of Picard and see if your problems have improved.

Bottom line: If you’re frustrated by how things are working right now, go back to a non-beta release and give us more time to iron out these issues. We’re aware that you are frustrated and we’re working on it. But if people keep pestering us with the same issues over and over again, it only keeps us from fixing actual bugs.

Technorati Tags: ,

New fingerprinting questions answered

As promised, here and some answers to most common questions: “Is there a contact at MusicDNS?” — Yes, use this link. A contact us page will be added soon. “libofa doesn’t build on OS X” — the first rev, 0.9.1 with the first patch included will be posted later today. Stay tuned — this should … Continue reading “New fingerprinting questions answered”

As promised, here and some answers to most common questions:

  • “Is there a contact at MusicDNS?” — Yes, use this link. A contact us page will be added soon.
  • “libofa doesn’t build on OS X” — the first rev, 0.9.1 with the first patch included will be posted later today. Stay tuned — this should also build on OS X.
  • “Are there plans to implement the Creative Commons sampling license options?” — Yes. I’ll add that and the Public Domain ‘license’ later today.
  • “What does PUID stand for?” — They are Portable Unique Identifiers.
  • “How do I get a PUID into the fingerprinting system?” — Right now you need to use the MusicIP Mixer (free) to analyze the track to get a PUID generated for any tracks that do not have PUIDs yet. PUIDs should become visible to MusicBrainz within 24 hours. This is far from perfect, but all we could get done for now. We’ll improve this before too long.
  • “It’s nice that we have an alternative to TRM now, but I’m disappointed that this is a one-way relationship.” — Thats not quite accurate. Our partnership is focused on creating a balanced relationship, but there is a limit to what we were able to accomplish for this initial release. We’ll be working on improving this soon.
  • “When will a version for OS X be available?” — Hard to say. One of the underlying toolkits still has a number of bugs that prevent us from releasing it on the Mac. Hopefully I’ll have some time to look into this soon.
  • “Maybe an obvious question (or answer), but does this excellent news mean I’ll have to re-tag all my files?” — No. The fingerprints are only used to resolve the proper metadata. Once the tagger identifies the right track, it writes MusicBrainz ids to the files, which have not changed.
  • “Since the accoustic fingerprint is opensource is there a role that MusicIP NEEDS to play?” — There is a server component to this as well but that is not open source. Without the server, the client portion is less than useful.
  • “How well does it compared to TRM?” — It should have a lot fewer duplicates and collisions than TRM. But really, time will tell. Let’s start using it and we’ll see how well it works. I do know that the PUIDs won’t have to be trimmed to keep the service alive, so this is already a drastic improvement.

Let me know if there are more questions!

(Update: The support link is now fixed.)

Technorati Tags: ,