Thank bots! Replaces man in the internet.

Comparison of the quality audience sites.

Editorial PC Magazine had an interesting esperiment to separate the wheat from the chaff, I mean, robots from humans. Omit thoughts on who's grain and go straight to the point. The goal was to test the hypothesis that, in principle, to calculate the formal parameters, which can characterize the quality of social networks and media on the Internet.

The value of the site is largely determined by its type. Types of sites for men with all the diversity of choice of all four: "Site Content", "site navigation", "social system" and Web-based service (for robots more types of sites, but it's understandable:)

Content site: site where you are. You are not a robot? Then you are dolzhnobyt interesting: this kind of site can be assessed by the 3 rd criteria: completeness of information "," covering the thematic field "and" editorial contribution. "
For the evaluation of these criteria should be assessed primarily as sources of information (especially appreciate the completeness of the information). Clearly, their sources, we do not rent, but all the same: there is nothing unique, there is only a skilled rerayt. This is Solomon said:

"- Sometimes, they say:" See, this new "- but it was already in the old time, which to us"

Well, since all have been mnogazhdy, then turn to the originals. They are mainly two: press releases and publishing colleagues. Accordingly, it is possible to trace the identity of the source and our web site - acceptor. Clearly, if you want this process amenable to automation. It has been collected database of all Internet primary sources, constitutes the knowledge portrait of the day ".

Editorial contributions "- is the degree of processing of the original content source. Revision proceeded from the assumption that pererabotany content carries more weight than just skopipaschenny as a result of processing the light of new materials, the advent of new expert comments, the value of publishing rights to the reader (and a robot:) increases.

Just be determined and completeness of information, ie, the ratio of the number of keywords in the publications (more - better).

Index "covering the thematic field" is defined as the number of significant "order of the day, caught in the online edition. Everything is simple: from the above-mentioned databases with information portrait of the day singled out and counted the number of topics, how many of them are mentioned in the news flow site.

The social system built around their communities. This social networking, blogs and forums, photo sites, video sharing system, many recreational resources, content which creates the collective wisdom of participants. In particular, interest is the fact that "reason".

As a result of the experiment revised concluded that a significant number of visits to web pages do not generate real people, and specialized robots (or bots): agents of news gathering, various "spiders", etc.
Web-robots usually come from outside. They can add entries to your blog bookmark in the social system, the replica in the forum. Such a robot can be dotatochno intellectual - he was sometimes able to vote, to open links, etc. There are systems that can mimic the "debate" in the comments or requests such as "hello to you, catch five. Leave a bot, simulating the "Pepsi generation", is now almost trivial (with systems where registration is sought, for example, passport number, a trick to throw harder). Not by chance, some dating services, even in TV commercials as one of the advantages presented with "the only real form. To estimate the ratio of robots and humans in the service revised register corresponding account on some social services. In blogs editorial posted announcements of articles copyright pcmag.ru, tape played the role of a stable source of incoming records. In addition, we created several virtual users who post entries and links to known popular topics (the list of those formed on the basis of the rating service "Yandeks.Blogi"). The study was recorded statistics and the reaction of "society". In assessing the results it became evident that there are specific scenarios of behavior that distinguish the human from the robot. Summarizing, we can say that a man is diverse and inconsistent, the robot is consistent and methodical. Ignoring the topic, you can continue the thought: purposeful, strong-willed and the next life plans people actually not so far from the robot, and ideally it is:)

From 15 to 30 percent of blogs for six months will be forgotten by the creators - are the conclusions of the experiment. But no worries: the percentage is more than vomestyat hordes of robots that replaced people: a LiveJournal is actively discussed the influx of bots that are associated with the recent scandal with the elimination of basic accounts, when a sufficiently large number of Members area closed their journals (the system shows about 15%, but this taking into account the mass-created magazines bots, so in fact more)

In addition, the experiment allowed to determine the level of education of the user, financial situation and the audience, etc. In the first case was formed to assess the pool of keywords that define the cluster of interest the audience in respect of which it is difficult to assume a high educational qualification. As the foundations were chosen name serials and comedy, popular among the mass audience (like "Happy Together", "Not Born Beautiful", etc. The data were extracted using the service "Yandeks.Blogi." Parameter to assess the material conditions of counting the number of references in diaries of purchases of expensive goods, tourist trips, on foreign trips, etc. In the column "The diversity of interests" are estimates, reflecting the breadth of interests of users of the system (as determined on the basis of analysis of "tag clouds" or categories of the blog) is more interesting figure is that the editors conventionally called the "herd instinct." This figure reflects the readiness of the audience to discuss her proposed topic, defined as the ratio of the average number of "SpyLOG Rambler 's on the same topic (with the same tag or in a single category) to the average length of discussion. The idea was to identify natural way folding community interested varying themes.

Another typical archetype of the sample site in 2007 - Web-service. In this case, analyze the content component - usually meaningless. Sites were chosen more for the relevance of service and technical realization (in some cases on the basis of earlier estimates, in particular, this applies to file sharing, photo site, etc.).

In conclusion, we emphasize again: the figures - it is not evaluation, and summary indicators, which reflect some trends that we identified in the course of the experiment on a limited array of data. It is expedient to consider as landmarks, artificial metrics that allow to reveal the specifics of specific resources.

Blogs and Community

System "The reality of the audience" Education and Intelligence Prosperity The diversity of interests
habrahabr.ru ****0****0***00***00
Privet.ru ****0**000**000***00
"Blogi@Mail.ru" **000**000**000***00
"LJ" ****0****0****0*****
"Rambler · Planet" ***00***00***00***00

Case Media

Site Information completeness Covering the news of the field "Drafting contribution"
lenta.ru ***00********(0
astera.ru ***00****0*0000
utro.ru ***00****0***(0
rbc.ru **************0
securitylab.ru ***00****0***(0
klerk.ru ***00***00***(0
regnum.ru ****0****0***(0
3dnews.ru ****0****0***00
membrana.ru ****0****0***(0
sostav.ru ****0****0*****

Meanwhile, blog SEO & Money carries out the action " Advertising for the sake of advertising "

On the same theme:

Roboblog
Tags: , , , , ,
Search-Bot Log

Enjoyed entry? Be sure to subscribe to updates by RSS or by email!

Leave your response!

I'm not a robot.

Liveinternet