Difference between revisions of "Program data quality"

From TV-Browser Wiki
Jump to: navigation, search
m (One more, to reflect an additional segment found in the German version. Also, wie man in Pennsylvanien sagt:: "Noo ses groadt gnink!")
 
(2 intermediate revisions by the same user not shown)
Line 1: Line 1:
== Program data quality ==
+
== Program Data Quality ==
  
The program data is received from the press office of each broadcast station.
+
The program data is received from the press office of each broadcast station, and is automatically processed; hence the quality depends on the data obtained.
  
As a matter of fact the data is automatically processed, hence the quality depends on the data obtained.
+
Some broadcast stations provide the data as structured XML.  From others, we receive the information in an informal and  proprietary format, like RTF.
  
Some broadcast stations provide XML-Data with structured information. Others just provide the information in a proprietary and unstructured format like RTF.
+
As a result, for example, in the data from some broadcast stations, the actors are listed individually, but not in that of others.
  
As a result there are some broadcast stations where ie. the actors are separated, on others they are not.
+
Some stations include information about the audio format (mono, stereo, dual or Dolby-Surround), while others don't.
Some stations deliver information about the audioformat (mono, stereo, dual or Dolby-Surround), others don't.
 
  
Some stations add some additional information, like information about the cast, which can't be separated automatically and so this information is within the description of the broadcast.
+
Some provide even more information, such as cast blurbs, which information is nevertheless unstructured, and therefore can't be parsed out automatically — hence, it appears within the general broadcast description.
  
Also classification of the data is not available (like: movie, quiz-show,...) and if it is available this data is also presented in a proprietary format and also the content itself is not standard (i.e. one station says "Documentation", while the other will call it "Info-Show",...).
+
Sometimes the classification of the data is unavailable (movie, quiz-show, etc.). If it is available then, once again, it is often presented in a proprietary format.  Beyond that, the terminology itself is not standardized (e.g., one station speaks of a "Documentary," while another will call it an "Info-Show").
  
As a matter of fact we are free of charge and so we cannot reformat all the data by hand.
+
Since TV-Browser is free of charge, we're unable to reformat all that data.  In particular, we can not make manual corrections to errors in the data, or modify programs to account for current events.
  
For everyone interested to present us data or who wants to know how the data is processed can find additional information in our [[Providing_TV_listings|tutorial]].
+
If you'd like to send us data, or if you'd like to know how the data is processed, you can find additional information in our [[Providing_TV_listings|tutorial]].
  
 
[[de:Qualität der Daten]]
 
[[de:Qualität der Daten]]
  
 
[[category:Usage]]
 
[[category:Usage]]

Latest revision as of 14:55, 31 January 2008

Program Data Quality

The program data is received from the press office of each broadcast station, and is automatically processed; hence the quality depends on the data obtained.

Some broadcast stations provide the data as structured XML. From others, we receive the information in an informal and proprietary format, like RTF.

As a result, for example, in the data from some broadcast stations, the actors are listed individually, but not in that of others.

Some stations include information about the audio format (mono, stereo, dual or Dolby-Surround), while others don't.

Some provide even more information, such as cast blurbs, which information is nevertheless unstructured, and therefore can't be parsed out automatically — hence, it appears within the general broadcast description.

Sometimes the classification of the data is unavailable (movie, quiz-show, etc.). If it is available then, once again, it is often presented in a proprietary format. Beyond that, the terminology itself is not standardized (e.g., one station speaks of a "Documentary," while another will call it an "Info-Show").

Since TV-Browser is free of charge, we're unable to reformat all that data. In particular, we can not make manual corrections to errors in the data, or modify programs to account for current events.

If you'd like to send us data, or if you'd like to know how the data is processed, you can find additional information in our tutorial.