Agree with i960 & truetype. If you have bad data, and have proof, then hiding the source makes no sense, and tends to restrict others being able to help.
IMHO: It is not uncommon to have to "clean" your data, or categorize according to confidence level. I use CBOE Livevol for SPX option data...