Bug in Title Recognition Algorithm

Roon incorrectly differentiates accented characters when it attempts to recognize compositions and link them to metadata. For example, look at this mess:

Those five “Preludes” PARTs (and seven others off the screen) should have been associated with one WORK:
“Preludes (12) for piano, Book I, L. 125 (117)”

Why didn’t it? After much trial and error, two PARTS were tagged as follows:
“Préludes (12) for piano, Book I, L. 125 (117)”…which caused the screen trainwreck.

I submit that these two should be treated as a match by Roon and grouped accordingly.

Roon has already recognized the need for such adjustments. It has successfully matched “Frédéric Chopin” and “Frederic Chopin” for a least a year now. That same simple idea needs to be extended to compositions.

One other thing: if WORK and PART tags are present, they should take priority over the TITLE contents. Right now, TITLE seems to win out. The Roon algorithm was probably written when few people even heard of the WORK and PART tags, so TITLE was the default source. A simple code change is needed, to wit, IF WORK and PART are non-blank, use them.

Sympathies John.

I have a great deal of trouble with Debussy. Almost every import is wrong in some way without a lot of manual editing.

The basic problem is that Debussy did not use Opus numbers. But there is a story behind the “L” catalog numbers I’m not sure everyone is aware of. I wasn’t until I looked into it out of frustration with roon’s very poor identification rates with Debussy.

The “L” is after François Lesure and the first system of “L” numbers was only introduced in 1977. What this means is that not only is it guaranteed that the Debussy back catalog has no Opus numbers it is also extremely common that there are no L numbers either. In effect this means that many works will be rather unpredictably titled in French (with accents) or in Franglais (without) depending on the release market and will have no other Opus or L number identifying marks.

But it gets worse. The Lesure system was completely revised in 2003 and it is that one that allmusic/rovi is using. That is why you often see two L numbers. The one without the brackets is the 2003 one and the one with the brackets is the 1977 one. But any comments I have seen imply that not many are following rovi’s lead on this. You are going to see a lot of French/Franglais with no L numbers and when you do see them it is more commonly the 1977 catalog.

You can see the complete catalog here with the 2003 L numbers with no brackets and the 1977 L numbers with brackets:

http://www.debussy.fr/cdfr/catalog/chronopus.php

If you are using public data bases to auto tag or you are working off the back of a legacy CD you may well be using no L number at all and when you do use one you are probably using the 1977 one. This will obviously cause a lot of confusion and is why with a Debussy import it is so often necessary to do a cut and paste of canonical titles from the allmusic website with both L numbers. This seems highly unsatisfactory to me. It must be possible to build more intelligence into roon’s Debussy recognition logic. Unfortunately there are similar tales with other Composers from the core repertoire requiring special logic handling to improve auto-recognition rates.

PS. I had a quick look at Qobuz and the L numbers being used depend entirely on the recording date. So for example a modern recording of Suite Bergamasque is L 82 whereas a mid-80’s one will be L. 75.

This means that sometimes your composition will be identified as Suite Bergamasque and sometimes as a Mazurka. This type of thing happens to me a lot. I also get a very common confusion between Golliwog’s Cakewalk and Deux Danses because L 113 is different depending on whether you assume the 1977 or 2003 catalog.

Yeah, I ran into that issue earlier and wound up amending most of my Debussy tracks.

But the L numbers were not the issue in the PO. It was the fact that there was an accented “e” in one or two “Preludes” WORK tags, and that alone disqualified the composition from being properly grouped.

Roon was successful ignoring accents in composer names. But composition names need a similar treatment.

Je parle le Franglais comme un native , in my case a native of Cheshire !

Guess I’ll stick to our Germanic composers

:nerd_face:

Ps there are a couple of books called something like “parley vous franglais” a total hoot

Mike

1 Like

Point taken.

Just to bring it back on topic what I think I was trying to say was that making equivalents of the French (with accents) and Franglais (without) will no doubt help but there needs to be several other common equivalents made as well. Otherwise most of the time we will just end up with a routine cut and paste edit from allmusic every time we import a Debussy disk because the underlying problem is unsolved. That is pretty much what my experience is at the moment.

Although not all possible combinations happen (it would be too illogical even for the labels), various combinations of the following are routinely happening and there may be a few others. My maths is not good enough to work out how many possibilities that is:

  1. French (with accents)
  2. Franglais (without)
  3. both French & (Franglais in brackets)
  4. both Franglais & (French in brackets)
  5. 1977 “L” numbers
  6. 2003 “L” numbers"
  7. both 2003 and (1977 “L” numbers in brackets)
  8. both 1977 and (2003 “L” numbers in brackets)

I imagine the scope and extent of title recognition could be expanded to the point that even IBM’s Watson computer would struggle.

But lots of search algorithms have managed to work around accented characters and can match the basic characters (Franglais, I guess).

I was POed to learn of the L debacle, but I have to give Roon a pass on that one. The accented character problem seems like an oversight (or dare I say it, bug) to me.

Have you had any luck with this one:

Sonate pour flûte, alto et harpe en Fa Majeur, L. 137

It is just an example. But I have many like these. A lot of the best of the core French repertoire, unsurprisingly, is recorded on French labels so this is what it looks like.

By the time that that rovi/allmusic gets to it, it looks like this in a strange mixture of French and English:

Sonate en trio, for flute, viola & harp, L. 145 (137)

But I still cannot get a composition id in roon. This is now quite a mash-up. Accents have been stripped out, both English and French words are being used as well as both L numbers. Still no match

Do you have a TIDAL “reference” album imported with this work?

Sorry my sense of humour will be my downfall

Well, the OP thought it amusing. :slight_smile:

Hey @Mike_O_Neill , no apologies. It was funny. Sometimes (well, always really my OCD takes over). :stuck_out_tongue_winking_eye:

I would really like to see some metadata traction on the French core. It is driving me absolutely bonkers. There must be some low hanging fruit. Accents has to be one. I even try translating the titles from Erato and the rest of the French labels in google translate. Who exactly is allmusic using to come up with such bonkers Franglais and have the cheek to call it canonical? The worse thing is that we start beleiving it. It is not canonical. there are no standards, No one has agreed to it. Where on earth does it come from?

Yes. After a lot of googling effort I worked out in this particular case the Franglais version that roon was expecting so that it could make a match. The problem is I have a long, long queue of others that I have not got round to.

I don’t have an answer. I don’t think anyone has. And I don’t want to rain on your parade. Ignoring accents in certain cases will help I am sure. In a handful of cases because of the shared history of the UK and France since the Norman invasion of 1066 the French and English spelling of words is the same. For example, prélude and prelude.

It’s just that in many cases the word itself and more importantly the grammar and the entire construction and word order of a phrase (like a work title) is different. I don’t know how far the industry is in commoditising this sort of localization logic.

You might have already run across this. MusicBrainz uses the original Cyrillic alphabet for all Russian artists. Makes it difficult for this mono-linguistic guy to search for Rachmaninov’s stuff.

R. Maninov, Rachamaninov, Rachamninov, Rachmaininoff, Rachmaniniv, Rachmaninkov, Rachmaninof, Rachmaninoff, Rachmaninoff’s, Rachmaninov, Rachmaninov, Sergei, Rachmaninov, Sergej, Rachmaninow, Rachmanioff, Rachmaniov, Rachmaniov, Sergej, Rachmanninoff, Rachmanyinov, Rachmininoff, Rachmnaninoff, Rachmoninoff, Rachmoninov, Racmaninov, Rahmaninof, Rahmaninov, Rahmanjinov, Rahmanyinov, Rakhmaninov, Rakhmaninov, S.V., Rchmaninov, S. Rachmaninov, S. Rachamaninoff, S. Rachamaninov, S. Rachamaņinovs, S. Rachmaniniov, S. Rachmaninof, S. Rachmaninoff, S. Rachmaninov, S. Rachmaninovas, S. Rachmaninovs, S. Rachmaninow, S. Rachmanninoff, S. Rackhmaninov, S. Rahmaninov, S. Rahmaņinovs, S. Rahmanjinov, S. Rakhmaninov, S. V. Rachmaninoff, S. V. Rachmaninov, S. W. Rachmaninoff, S.Rachmaninoff, S.Rachmaninov, S.V. Rachmaninov, S.V.Rachmaninov, Segey Rachmaninov, Serg ej Rachmaninow, Serg. Rachmaninow, Serge Rachmaninof, Serge Rachmaninoff, Serge Rachmaninov, Serge Rachmaninow, Serge Vassilievich Rachmaninov, Serge W. Rachmaninow, Sergei Rachmaninoff, Sergei Rachmaninov, Sergei Rachamaninov, Sergei Rachaninoff, Sergei Rachmainoff, Sergei Rachmaninof, Sergei Rachmaninoff, Sergei Rachmaninov, Sergei Rachmaninow, Sergei Rachmanioff, Sergei Rachminoff, Sergei Rachmoninov, Sergei Racmaninoff, Sergei Rahmaninov, Sergei Rakhmaninov, Sergei V. Rachmaninoff, Sergei V. Rachmaninov, Sergei Vasil’yevich Rachmaninov, Sergei Vasilievich Rachmaninoff, Sergei Vasilievich Rachmaninov, Sergei Vasilievich Rachmanioff, Sergei Vasilievitch Rachmaninov, Sergei Vasilyevich Rachmaninoff, Sergei Vasilyevich Rachmaninov, Sergei Vassilievich Rachmaninoff, Sergei Vassilievich Rachmaninov, Sergei Vassilievitch Rachmaninoff, Sergei Vassillevitch Rachmaninov, Sergeij Rachmaninow, Sergej Rachmaninoff, Sergej Rachmaninnoff, Sergej Rachmaninof, Sergej Rachmaninoff, Sergej Rachmaninov, Sergej Rachmaninow, Sergej Rachmanioff, Sergej Rahmaninov, Sergej Rahmanjinov, Sergej Rchmaninov, Sergej W. Rachmaninow, Sergej Wassiljewitsch Rachmaninow, Sergel Rachmaninoff, Sergey Rachmaninoff, Sergey Rachmaninoff, Sergey Rachmaninov, Sergey Rahmaninov, Sergey Rakhmaninov, Sergey Rakmaninov, Sergey Vasil’yevich Rachmaninov, Serghei Rachmaninoff, Serghei Rachmaninov, Serghei Rahmaninov, Serghei Rakhmaninov, Serghei V. Rahmaninov, Serghei Vassilevic Rachmaninov, Sergi Rachmaninoff, Sergie Rachmaninoff, Sergie Rachmaninov, Sergiej Rachmaninow, Sergiusz Rachmaninow, Sergiusz Rahmaninow, Sergueï Rachmaninoff, Serguei Rachmaninov, Sergueï Rachmnaninov, Sergueï Vassilievitch Rachmaninov, Serhej Racjmaninov, Servei Rachmaninov, Siergej Rachmaninow, Siergiej Rachmaninow, Sz. V. Rahmanyinov, V. Rachmaninov, W. Rachmaninoff, Рахманинов, Рахманинов Сергей, С Рахманинов, С. В. Рахманинов, С. В. Рахмањинов, С. Разманинов, С. Рахмаљинов, С. Рахманинов, С. Рахманінов, С. Рахманнинов, С. Рахмањинов, С. Рахмнинов, С.В. Рахманинов, С.Рахманинов, Сергей Васильевич Рахманинов, Сергей Рахманинов, Сергеј Рахманинов, Сергеј Рахмањинов, Сергій Рахманінов, סרגיי רחמנינוב, ラフマニノフ