Query Library

Which English Wikipedia articles have at least one link to bbc.com?

Viewing the first 20 of total 1,303 rows

Download Full Results
titleurl
Loganairhttps://en.wikipedia.org/wiki/Loganair
List of association footballers who died while playinghttps://en.wikipedia.org/wiki/List_of_association_footballers_who_died_while_playing
NULLhttps://en.wikipedia.org/wiki/Land_of_Punt
Smartmatichttps://en.wikipedia.org/wiki/Smartmatic
Tasmanian Wilderness World Heritage Areahttps://en.wikipedia.org/wiki/Tasmanian_Wilderness_World_Heritage_Area
Rugby leaguehttps://en.wikipedia.org/wiki/Rugby_league?rdfrom=http%3A%2F%2Fwww.wiki.jeremymacpherson.net%2Findex...
Ted Kaczynskihttps://en.wikipedia.org/wiki/Theodore_Kaczynski
Marriage (Same Sex Couples) Act 2013https://en.wikipedia.org/wiki/Marriage_(Same_Sex_Couples)_Act_2013
War on Terrorhttps://en.wikipedia.org/wiki/The_war_on_terror
Jim Murphyhttps://en.wikipedia.org/wiki/Jim_Murphy
Sami Khedirahttps://en.wikipedia.org/wiki/Sami_Khedira
Warhammer 40,000https://en.wikipedia.org/wiki/Emperor_of_Mankind
Domestic violencehttps://en.wikipedia.org/wiki/Domestic_Violence
Apehttps://en.wikipedia.org/wiki/Hominoid
Taiwanhttps://en.wikipedia.org/wiki/Republic_of_China
Circuit diagramhttps://en.wikipedia.org/wiki/Electrical_schematic
Taiwanhttps://en.wikipedia.org/wiki/R.O.C
Haredi Judaismhttps://en.wikipedia.org/wiki/Charedi
McDonnell Douglas MD-80https://en.wikipedia.org/wiki/McDonnell_Douglas_MD-80_/_MD-90
History of HIV/AIDShttps://en.wikipedia.org/wiki/History_of_HIV/AIDS

Viewing the first 20 of total 1,303 rows

Download Full Results
Query
select
    css_text_first(content, 'h1#firstHeading') as title,
    url
from 
    pages
where 
    url_domain = 'wikipedia.org' 
        and
    url like 'https://en.wikipedia.org/wiki/%'
        and
    url not like 'https://en.wikipedia.org/wiki/Talk:%'
        and        
    contains_any(content, array['href="https://www.bbc.com', 'href="http://www.bbc.com', 'href="https://bbc.com', 'href="http://bbc.com', 'href="www.bbc.com', 'href="bbc.com'])
Load in editor
Data scanned30.64 GB
Results99.53 KB (1,303 rows)

Turn the web into a database!

Mixnode is a fast, flexible and massively scalable platform to extract and analyze data from the web.

or contact us at hi@mixnode.com