identifying primary sources in the news
Jan. 3rd, 2010 01:25 am![[personal profile]](https://www.dreamwidth.org/img/silk/identity/user.png)
I think it is well-known that the news media copies stories from each other (presumably with enough changes to not seem too blatant*, but nevertheless the typical lack of citations would suffice to be hung for plagiarism in any academic field). This process can be described as a DAG in which the nodes are stories and the arrows represent copying, and only some nodes are observed (printed). If you abstract over the rewordings, a diagram displaying merges and splits might not look that different from a version control graph.
Query: What aspects of these networks are public knowledge, and to what extent can we trace the history of an individual news item?
Have people developed tools to help this science of "news forensics"? My hope is that by figuring out who the original sources are and what they actually said, we can cancel out the effect of the replicator dynamics (what you see is what sells), and thus get more objective information.
* - If this were a single (rather than a developing) piece of news, and if everyone were honest, one would expect that the non-blatant copying means that the news gets phrased more and more awkwardly. In reality, I suspect they rephrase things without regard for the facts (and often towards sensationalism).
Query: What aspects of these networks are public knowledge, and to what extent can we trace the history of an individual news item?
Have people developed tools to help this science of "news forensics"? My hope is that by figuring out who the original sources are and what they actually said, we can cancel out the effect of the replicator dynamics (what you see is what sells), and thus get more objective information.
* - If this were a single (rather than a developing) piece of news, and if everyone were honest, one would expect that the non-blatant copying means that the news gets phrased more and more awkwardly. In reality, I suspect they rephrase things without regard for the facts (and often towards sensationalism).
(no subject)
Date: 2010-01-03 12:12 pm (UTC)(no subject)
Date: 2010-01-03 05:20 pm (UTC)If it hasn't been done already, it would be a fascinating summer-project length piece of work. You'd need a journalism prof to get the embarrassing results into that community, and perhaps a stats prof or just a mentor at google with the knowledge you already have to get the math and programming right.
(no subject)
Date: 2010-01-03 05:21 pm (UTC)(no subject)
Date: 2010-01-04 06:30 am (UTC)(no subject)
Date: 2010-01-04 01:34 pm (UTC)<< but otherwise stories are "original". That is, the different outlets each have their own reporter who writes her own story. >>
but do they do the research independently? when you see the same thing written in two independent outlets, do you have independent confirmation that it really happened that way?
(no subject)
Date: 2010-01-04 05:34 pm (UTC)