Skip to content

Maaspr/100queries

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

72 Commits
 
 
 
 
 
 

Repository files navigation

The 100 Queries Project

Background

We, Maarten Sprenger and Carsten Schnober, investigated the quality and usability of the search results provided by Google for a hundred different search queries by children aged eight to twelve for ‘Slim Zoeken’.

With this project we want to demonstrate that, on the one hand, it is a waste of time and misleading for children to just let them google something (without further guidance). On the other hand, we want to make a start on systematically answering the question of what web search is worth at the moment, both from the perspective of the supply on the web and from what is retrieved by Google.

With this we would like to connect to a revival that we see in the critical approach to web search for users.

The underlying idea is that in terms of observation (expert opinion) an estimated 90% of the result pages have a commercial component, varying from content as a vehicle for advertisements to product descriptions and blogs that should lead to more traffic to the websites in question, under the SEO motto ‘Content is king’. This combined with the experience that at the same time about 80% of the results pages for primary school children seem to be (too) difficult to access. Up until now, this was mainly anecdotal evidence from Maarten's more than fifteen years of experience in working with online information for children (see also M. Sprenger, Children's Informatie Who cares?, 2014).

Who are we?

Maarten Sprenger

Maarten Sprenger is an information professional, educational editor and author of two Slim Zoeken books for 8-14 year olds. He is currently working on Slim3, understandable information about online searching for school, work and home. Free public access for everyone, in two versions, for 8-14 and for 15+.

From 2019 to the present, Maarten advised at various times in the context of the Actualisatie van de nieuwe Kerndoelen ("Actualization of the new Core Education Objectives"), for the subjects Dutch and for Digital Literacy.

slimzoeken.nu, linkedin.com/in/msprenger

Carsten Schnober

Carsten os a software engineer with a background in Natural Language Processing (NLP) and Machine Learning. Apart from being an experienced engineer, he is interested in the mutual impact of technology and society. Carsten works for the NL eScience Center, where he develops advanced technological research software for researchers in the Humanties and Social Sciences. Software engineer and Natural Language Processing (NLP) researcher with a critical view on society and technology.

linkedin.com/in/carsten-s-a1aba0, esciencecenter.nl/team/carsten-schnober/

The Study

We provide a comprehensive description of the methodology on our Notion page in Dutch. Find an overview in English below.

Design

For this study, we selected one hundred queries from a list of two hundred random, authentic searches by primary school pupils. The queries originate from Wizenoze bv, with thanks to Thijs Westerveld.

Queries, result pages and associated sources are stored in three databases, in which all records were provided with labels on metadata, relevance, quality and parent companies. A fourth database for parent companies was added later to gain a better picture of the clustering of content providers.

Methodology

Query selection

The selection of queries was made as follows:

  • Dutch language
  • Primary school age (8-12 years)
  • Period 1 year (2022)
  • Random samples, based on frequency (because not only the top queries are interesting)
  • Delivered in a set of 200; manually reduced to a working set of 100 with a balanced distribution of topics, including dirty words and brand names. Evidently incomplete text and vague typos (haaaaaaaaao) and repetitions ('Ronaldo', 'Cristiano Ronaldo') were omitted.

Output

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%