Difference between revisions of "Program data quality"

From TV-Browser Wiki
Jump to: navigation, search
(Added link to german version)
m (One more, to reflect an additional segment found in the German version. Also, wie man in Pennsylvanien sagt:: "Noo ses groadt gnink!")
 
(8 intermediate revisions by 5 users not shown)
Line 1: Line 1:
== Program data quality ==
+
== Program Data Quality ==
  
The program data is pass to us by the press office of each broadcast station.
+
The program data is received from the press office of each broadcast station, and is automatically processed; hence the quality depends on the data obtained.
  
As a matter of fact the received data is automatically processed and so the quality depends on the data which is presented to us.
+
Some broadcast stations provide the data as structured XML.  From others, we receive the information in an informal and proprietary format, like RTF.
  
Some broadcast stations give us XML-Data where the Informatoin is quite structured. Others just give us the information in a propritary and unstructured format like RTF.
+
As a result, for example, in the data from some broadcast stations, the actors are listed individually, but not in that of others.
  
The result is that you can see some broadcast stations where ie. the actors are seperated on some others they are not.
+
Some stations include information about the audio format (mono, stereo, dual or Dolby-Surround), while others don't.
Or some stations pass us the information about the audioformat (mono, stereo, dual or Dolby-Surround) others don't.
 
  
Some Stations add some additional information, like information about the cast, this can't be seperated automatically and so these informations are within the description of the broadcast.
+
Some provide even more information, such as cast blurbs, which information is nevertheless unstructured, and therefore can't be parsed out automatically — hence, it appears within the general broadcast description.
  
Also clasification of the data is not available (like: movie, quiz-show,...) and if it is available this data is also presented in a propritary format and also the content itself is not standard (ie one stations says "Documentation" the other will call it "Info-Show",...).
+
Sometimes the classification of the data is unavailable (movie, quiz-show, etc.). If it is available then, once again, it is often presented in a proprietary format.  Beyond that, the terminology itself is not standardized (e.g., one station speaks of a "Documentary," while another will call it an "Info-Show").
  
As a matter of fact we are free of charge and so we cannot reformat all the data by hand.
+
Since TV-Browser is free of charge, we're unable to reformat all that data.  In particular, we can not make manual corrections to errors in the data, or modify programs to account for current events.
  
For everyone interested to present us data or who wants to know how the data is processed can find additional informations in our [[Providing_TV_listings|tutorial]].
+
If you'd like to send us data, or if you'd like to know how the data is processed, you can find additional information in our [[Providing_TV_listings|tutorial]].
  
 
[[de:Qualität der Daten]]
 
[[de:Qualität der Daten]]
 +
 +
[[category:Usage]]

Latest revision as of 14:55, 31 January 2008

Program Data Quality

The program data is received from the press office of each broadcast station, and is automatically processed; hence the quality depends on the data obtained.

Some broadcast stations provide the data as structured XML. From others, we receive the information in an informal and proprietary format, like RTF.

As a result, for example, in the data from some broadcast stations, the actors are listed individually, but not in that of others.

Some stations include information about the audio format (mono, stereo, dual or Dolby-Surround), while others don't.

Some provide even more information, such as cast blurbs, which information is nevertheless unstructured, and therefore can't be parsed out automatically — hence, it appears within the general broadcast description.

Sometimes the classification of the data is unavailable (movie, quiz-show, etc.). If it is available then, once again, it is often presented in a proprietary format. Beyond that, the terminology itself is not standardized (e.g., one station speaks of a "Documentary," while another will call it an "Info-Show").

Since TV-Browser is free of charge, we're unable to reformat all that data. In particular, we can not make manual corrections to errors in the data, or modify programs to account for current events.

If you'd like to send us data, or if you'd like to know how the data is processed, you can find additional information in our tutorial.