Scraping the Chinese Internet: Challenges, opportunities and approaches 🇬🇧
This talk discusses web scraping techniques for the Chinese internet, covering sources like social media and official government websites. It highlights that, contrary to popular belief, China’s political processes do not need to be a black box. Millions of government documents can be found online that can illuminate developments in the country, sometimes even on sensitive affairs. Web scraping techniques can help pry open this black box and help see the forest for the trees. Yet, the talk also focuses on how extensive challenges remain, ranging from geo-blocking to “real-name registration”.
Combining a deep understanding of China’s political and legal system with innovative data-driven approaches, Vincent Brussee helps stakeholders understand crucial developments in China and their impact on the world. His analyses have been featured in popular media like BBC World News and Foreign Policy as well as leading academic journals like The China Quarterly. Most recently, his acclaimed book Social Credit (Palgrave Macmillan, 2023) provides a fresh and engaging analysis of China’s oft-misunderstood Social Credit System.