Mending Wall: On the Implementation of Censorship in India

Gosain, Devashish; Agarwal, Anshika; Shekhawat, Sahil; Acharya, H. B.; Chakravarty, Sambuddho

doi:10.1007/978-3-319-78813-5_21

Devashish Gosain²⁰,
Anshika Agarwal²⁰,
Sahil Shekhawat²⁰,
H. B. Acharya²¹ &
…
Sambuddho Chakravarty²⁰

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 238))

Included in the following conference series:

International Conference on Security and Privacy in Communication Systems

1742 Accesses
3 Citations

Abstract

This paper presents a study of the Internet infrastructure in India from the point of view of censorship.

First, we show that the current state of affairs – where each ISP implements its own content filters (nominally as per a governmental blacklist) – results in dramatic differences in the censorship experienced by customers. In practice, a well-informed Indian citizen can escape censorship through a judicious choice of service provider.

We then consider the question of whether India might potentially follow the Chinese model and institute a single, government-controlled filter. This would not be difficult, as the Indian Internet is quite centralized already. A few “key” ASes (\({\approx }1\%\) of Indian ASes) collectively intercept \({\approx }95\%\) of paths to the censored sites we sample in our study, and also to all publicly-visible DNS servers. 5, 000 routers spanning these key ASes would suffice to carry out IP or DNS filtering for the entire country; \({\approx }70\%\) of these routers belong to only two private ISPs. If the government is willing to employ more powerful measures, such as an IP Prefix Hijacking attack, any one of several key ASes can censor traffic for nearly all Indian users.

Finally, we demonstrate that such federated censorship by India would cause substantial collateral damage to non-Indian ASes whose traffic passes through Indian cyberspace (which do not legally come under Indian jurisdiction at all).

Access provided by CONRICYT-eBooks. Download conference paper PDF

Internet Censorship in Italy: A First Look at 3G/4G Networks

Internet Censorship Capabilities in Cyprus: An Investigation of Online Gambling Blocklisting

Leveraging Internet Services to Evade Censorship

Keywords

1 Introduction

The current study of Internet censorship is mostly focused on openly censorious countries – China [37, 43, 52], Iran [34], Pakistan [55], etc. Even world-wide studies of censorship [32] essentially focus on countries well known for their censorship. However, in practice, many other countries still implement some form of censorship, which may even be more insidious because citizens are barely aware of it (for example, Sweden [6] and France [4]). In this paper, we consider the case of India, a major emerging power with over 450 million Internet users [19] (up from 180 million in 2013, and on track to overtake Europe, which has 520 million users in all). India has been ambivalent about its censorship policy for years [13] (for example, in August 2015, the government ordered 857 target sites blocked, then backtracked in the face of public outcry [24]), but in context of the fact that legally^{Footnote 1} the executive branch in India holds unqualified power to block information, it is natural to be concerned about free speech in India. We begin by asking what policy, and what mechanism the Indian government currently employs; how this might change in future; and what unintended effects such censorship might have on foreign traffic transiting Indian ASes.

Our first step was to formally approach the authorities, by filing a Right to Information [25] request (RTI), inquiring about the policies and mechanism the government uses to block content. While the policy itself was confidential, the government was willing to share that the responsibility for filtering lies with individual ISPs, and that they could implement any mechanism they choose^{Footnote 2}, as long as they uniformly comply with the given censorship policy.

In practice, an ad hoc approach to filtering generally leads to inconsistencies and errors [54], especially during updates [48]. Our initial experiments suggest that this is indeed the case; filtering policies are highly inconsistent across ISPs (see Table 1), contrary to the government’s expectations as stated in the official response. The current “feudal” approach to policing the Internet in India, viz. allowing ISPs to implement their own censorship mechanisms (which, as we show, do not “strictly adhere” to government diktats), results in inconsistent censorship policy enforcement: for e.g., our findings show that users may be able to evade censorship more easily when accessing pornographic sites via Airtel, a large private ISP that screens fewer sites, compared to others such as MTNL.

We next consider the question of how, in future, the government might enforce a unified censorship policy for the whole country. The usual mechanism to enforce a single policy, is to redirect all Internet traffic through a single point of control, where all the traffic can be monitored(this approach has been employed by Iran [34], Venezuela [7], and Saudi Arabia [60]). Even in the case of China, a whole layer of state-controlled ASes must be used to act as a filtering layer that provides Internet connectivity to other ASes [60]. Nearly all the filtering is carried out by two Autonomous Systems - AS 4134 and AS 4812 [62].

Can the government, in future, force all networks to re-route their traffic via a chosen ISP so as to monitor the network? We note that India’s Internet infrastructure was grown through a laissez-faire approach (closely correlated with the cellular networking boom), and now consists of \({\approx }900\) ASes (over 170 of which are ISPs) [28]; it would require a massive effort to redirect all traffic through this new provider. Quite likely, the amount of disruption caused by such a redirection would make it difficult for a democratic nation to implement by fiat.

Might the government implement filtering with the existing infrastructure, without necessarily enforcing traffic redirection? For the existing network, is it possible to find a small set of “heavy-hitter” ASes (and network elements in these ASes) that can potentially monitor or censor traffic without too much collateral damage? More formally:

Is it feasible to filter/monitor India’s Internet traffic? If so, how, and where? Given that India has over 900 ASes,
1. 1.
  Are there a small number of key ASes and routers where the government can intercept most Indian traffic to censored sites?
2. 2.
  How does the number of censorious ASes required, vary with the censorship technique – e.g. IP blacklisting, DNS Injection, IP Prefix Hijacking?
How much collateral damage will traffic filtering cause? Internet censorship by an “upstream” AS can lead to inadvertent traffic filtering for its customers. How much impact can Indian censorship have on traffic that simply transits Indian cyberspace?

To answer the above questions, in this paper, we map the AS-level paths from each Indian AS to the potentially censored websites (our test corpus includes not only the sites publicly announced as being blocked, but also others from public resources such as Herdict [12]). We then construct router-level maps within these ASes, using Rocketfuel [58]. Finally, we identify the “key” ASes and routers, i.e. those which appear in an overwhelming majority of paths (and which are, therefore, the logical locations for network filtering).

Our experimental findings reveal that ten ASes cumulatively intercept over \(95\%\) of the paths connecting Indian ASes to the sites in our study (i.e. potentially censored sites). Eight of these key ASes, acting together, can poison \({\approx }99\%\) of the network paths leading to DNS resolvers in India (as well as other publicly available services such as GoogleDNS and OpenDNS), thus censoring URL requests. Even more alarming, when we consider another mechanism of censorship - IP Prefix Hijacking - we find five ASes, each of which can individually poison the BGP routes for almost all ASes in the country. Even though the actual number of routers needed for such efforts varies dramatically (from 7 in some ASes, to as high as 1782), overall, a total of less than 5000 routers across all the eight ASes are required for IP or DNS filtering – about \(70\%\) of which routers belong to two large private ISPs and any one of five key ASes is enough, if the government resorts to more aggressive measures like IP Prefix Hijack.

Finally, we note that paths that transit Indian ASes but originate outside India form a substantial fraction of the Internet: if India were in fact to adopt a comprehensive censorship scheme in its key ASes, she would censor about \(1.15\%\) of all Internet paths to the censored sites, worldwide.

Thus, the above findings would indicate that, in fact, ordinary Indian citizens should be concerned about censorship, and perhaps start to equip themselves with anti-censorship tools [39].

We begin by discussing the background and related work, in the next section.

2 Background and Related Work

The interaction of the Internet with government policy (especially censorship and privacy issues) is a controversial subject [14, 15, 30]. Our case study in this paper, India, is a democratic nation, but there is sufficient evidence of Indian censorship [8, 21] that anti-censorship research organizations declare India “partly-free” [20]. For example, the Indian government officially demands that organizations (e.g. Google Inc., Microsoft etc.) censor pages deemed objectionable [9].

At present, the government delegates the censorship of traffic to ISPs, as per ambiguous blacklists^{Footnote 3}. This loose approach to censoring traffic leads to inconsistent filtering across ISPs – some users may be able to evade censorship by virtue of their provider ISP.

The question arises whether the Indian government can impose a centralized filter (as seen in e.g. Iran). Creating a new AS and redirecting through it would have high costs in network disruption, latency, service quality, and so on. But such a process will not be necessary if the current structure of Indian Internet is already well suited for monitoring and censorship.

To determine the set of ASes and routers where adversary may install infrastructure for censoring large fraction of network paths, as they exist today, we generated AS and router-level maps of India. We used such maps to identify such key ASes and routers, and the impact they have.

2.1 Background

Our paper relies heavily on mapping the structure of the Internet, an area of research called network cartography [44]. The Internet consists of routers and hosts, but also has some further structure: the routers and hosts belong to Autonomous Systems, which are independent networks (independent in the sense, they themselves choose who to exchange traffic with). Consequently, Internet mapping proceeds at two levels:

1.
AS-level mapping. For our research, we required Internet maps representing paths connecting IP address of censored site to various ASes. We thus chose Qiu and Gao [56] AS path mapping approach. Their technique uses publicly-available BGP routes (obtained from various Internet Exchange Points across the globe [31])) and the relationships between the ASes [41], and outputs a directed graph of the Internet connecting IP prefixes to all ASes of the world.

Other AS-level mapping approaches, such as the CAIDA Ark Project [3] and iPlane [53], involve traceroute probes from various vantage points to IPs in different ASes. Such approaches rely on traceroute and are generally limited by the network locations and availability of the volunteered probing nodes; they may not provide the AS-level path between any two randomly chosen ASes.
2.
Router-level mapping. An AS is not a black box, but contains hosts and routers. Mahajan et al. [58] show how the internal structure of an AS can be mapped, by a combination of traceroute probes, IP alias resolution^{Footnote 4}, and reverse DNS lookups.

Powers of the Adversary: Our adversary is a censorious government. The adversary aims to filter Internet traffic, and for this purpose may perform IP filtering, DNS injection/URL Filtering, and IP prefix hijacking attacks. We note that even a government has limitations; for example, it would prefer to implement filtering at a small number of locations, rather than at every ISP network in the nation, because of both various political and technical factors (e.g. if changing the blacklist implies wide scale router level re-configuration, there will almost certainly be inconsistencies and failures in enforcement).

2.2 Related Research

Much of the study of modern Internet censorship was developed in the context of China [49, 61,62,63], particularly the different censorship techniques employed and the network destinations filtered. For e.g., Winter and Lindskog [61] examine how the Chinese authorities use DPI-capable routers to detect Tor Bridges. Others, such as [33], explored the mechanics of DNS filtering and how China is contributing to collateral damage. A major step forward was made by Verkamp and Gupta [32], who deployed clients in 11 countries (including India) to identify their network censorship activities – IP and URL filtering, keyword filtering and DNS censorship etc. Later authors – Nabi [55] in Pakistan, and Halderman et al. [34] in Iran – demonstrate different methods of censorship employed by their respective regimes, as well as different forms of content blocked. Such studies of censorship in repressive regimes are often limiting, as they require Internet access from almost all network locations inside the country (Nabi et al. were able to get access from only five locations, and Halderman from only one).

We take a different direction with this paper. While we begin by examining instances of network censorship in our target country (India), our main aim is to determine the potential for censorship, in case the regime decides to become more censorious. Specifically, how bottlenecked is the Indian Internet? Is it possible for the adversary to place censors in a relatively small set of ASes and routers, and still filter a large fraction of network paths (and thus potentially users)? - if so, this presents a much lower barrier to entry than monitoring in every AS.

The most relevant related work we are aware of, is Singh et al.’s study of how Internet censorship correlates to network cartography [59]. The authors show a strong correspondence between the Freedom House Index [5] of a nation and its Internet topology, and indeed, claim that a nation’s network topology is the best indicator of a countrys level of freedom. Our work makes use of network topology as well: we use it to determine the “key” network locations (ASes and routers) where the adversary (censorious government) would rationally deploy censorship infrastructure, if its aim was to censor all or almost all Internet traffic in the country, and the impact of such measures on network paths originating both within and outside the nation (but transiting Indian ASes). We perform this study for various traffic filtering techniques in the following section.

3 Motivation, Problem Description and Methodology

3.1 Preliminary Findings and Motivation

Well-studied censorious countries, such as China, Iran, and Saudi Arabia, tend to have a very clear censorship policy. In contrast, India has a rather ad hoc approach: the government expects all ISPs to (independently) enforce its policies. We find that in practice, traffic filtering is highly inconsistent across popular Indian ISPs – the set of blacklisted sites varies by orders of magnitude.

Table 1. Censorship trends in India: some initial results.

Full size table

To study such inconsistencies, we selected a list of 540 potentially censored websites, divided into 8 different categories (ranging from escort services, to anti-censorship tools like Tor [40]). We then systematically observed the censorship policy in different ISPs, by trying to access our potentially-censored websites through them.

Table 1 summarizes our findings. The rows represent the ISPs, columns correspond to the category of site which being filtered, and each entry is a 3-tuple \((c_n, o_n, x_n)\) representing the number of each type of response – censored, open, and inaccessible.^{Footnote 5} For example, we probed 150 escort websites through the Airtel network, and observed 50 to be censored, 80 open, and 20 inaccessible.

We note that the variation of censorship by ISP is quite dramatic: Airtel blocks only 1 out of the 50 pornographic sites probed, whereas MTNL blocks 45.

It is clearly difficult to get hundreds of independent ISPs to correctly comply with censorship orders. The question arises whether, if the government decides to enforce a single policy, it is able to do so. So the question arises, are there a few key bottlenecks in the existing network, where filtering may be carried out?

3.2 Problem Description

In our research we are particularly interested in finding a small set of key locations (ASes and routers) that intercept a large fraction of network paths. More specifically, our questions are as follows.

Is it possible for the government to monitor/censor a large fraction of Internet traffic by controlling only a small number of network locations (viz. ASes and routers)?
What fraction of traffic could be filtered, and who would be most affected?
Would such censorship affect users outside the country as well?

3.3 Evaluation Methodology

Identifying Potential Network locations for IP Filtering: In order to estimate the locations for installing IP filtering infrastructure, we built an AS-level map using paths in the Internet, then focused on Indian ASes and their connections. Our map was built using Gao’s algorithm [56], which finds AS-level paths to the home AS of chosen IP prefixes (in our case, censored sites) from every other AS in the Internet. The algorithm uses links from known AS paths in BGP routing tables; we obtained tables from a number of vantage points [31].

Unlike other nations, which have an unambiguous list of blocked sites [55], India has no clear censorship policy. We created a corpus of sites blacklisted by various government decrees (as reported by popular media), and also added the sites reported as blocked in India by the crowd-sourced censorship-reporting sites like Herdict [12]. These included social media sites, political sites, sites related to unfriendly nations, and p2p file-sharing sites. Finally, we added to the list the adult sites popular in India (as per Alexa [2]).

We randomly sampled about 100 sites from this corpus. We then computed the paths between all Indian ASes and these prefixes. The ASes appearing in these paths were sorted by frequency of occurrence; we thus selected the few most frequent ones.

Do these ASes appear in paths to other potentially blocked sites as well? To answer such questions, we re-estimated our paths with another set of about 220 sites, chosen from the corpus. The heavy-hitter ASes for this new set of paths were the same as the ones found before.

Intra-AS Topology Generation: In the second round of experiments, we employed the Rocketfuel algorithm [58] to compute the router-level paths through 10 heavy-hitter ASes (i.e. major Indian ISPs), then identified the routers which occur in a large fraction of paths (i.e. the heavy-hitter routers in heavy-hitter ASes), as follows.

1.
Using planetlab nodes, we ran traceroute probes to three representative IPs in each prefix advertised by the ASes and by their immediate (1-hop) customer ASes.

Traceroute returned router level paths leading to and out of the said ASes.
2.
From the traceroute trace, we chose the sub-paths consisting of router IPs advertized by the AS under study (i.e. router within the ASes, identified from [16]).
3.
We resolved the aliases (corresponding to the discovered router IPs) with Midar [18] alias resolution tool.
4.
Finally, from the discovered traceroute paths we selected the minimum number of routers which cumulatively intercept a large fraction of the paths. To do this we chose the following heuristic:
- If total number of edge routers are less than total number of edge and core routers that intercept a large fraction of the paths (over \(90\%\)), then we selected the edge routers alone (as the set of edge routers cover \(100\%\) of paths through the AS).
- Else, we selected the “heavy-hitter” (core plus edge routers), appearing in a very large fraction of the paths (over \(90\%\)); not all edge routers may appear as often as others (edge and core routers appearing in the discovered paths).

Identifying Potential Sites for DNS Based Filtering: Another common approach to censorship is to prevent the DNS service from resolving requests. The censor either instructs DNS servers (within its jurisdiction) to filter requests for blacklisted URLs, or installs infrastructure to intercept DNS queries on routers (en-route to DNS servers) and respond with bogus IPs or NXDOMAIN responses – also referred to as DNS Injection attack.

Filtering DNS requests, either by simply dropping them, or by responding with bogus responses, could be carried out at the DNS server. However, in a country like India, hosting more than 55000 DNS servers, distributed across different networks, reconfiguring all such servers to filter DNS queries for blacklisted sites would not be easy (besides simple disobedience, there would also be misconfiguration bugs, delays, and network downtime). It would be much more practical to identify a few ASes (and routers therein), that intercept all or almost all the network paths connecting DNS servers to all ASes in the country.

To identify key ASes for DNS injection, we began by identifying the DNS resolvers across all Indian prefixes. We probed IP prefixes of every Indian AS for available DNS servers (UDP port 53) using nmap [51], and noted whether the response was open, filtered, or closed. (Closed corresponds to ICMP ‘destination port unreachable’ message responses from the destination. Open means the client received a meaningful response. Filtered indicates that the client received no response^{Footnote 6}.)

Each IP, for which we obtained a filtered or open response, was sent a request to resolve the IP address of some popular WWW destinations (e.g. https://www.google.com). Addresses that allowed resolution were added to our list of publicly available DNS resolvers.

Finally, using Gao’s algorithm, we constructed a graph of prefix-to-AS paths connecting the IP prefixes corresponding to DNS resolvers, and all the Indian ASes. To find the ASes which would be most effective at DNS injection, we identified ASes at the intersection of a large number of these paths.

Impact of IP Prefix Hijack Based Censorship: In an IP Prefix Hijacking attack, malicious BGP routers advertise fake AS-level paths^{Footnote 7} in an attempt to poison routes to an IP prefix (see Fig. 1), thus attracting a large volume of traffic [35, 36, 42, 45, 57].

Prefix hijacking is an extremely aggressive attack, and unlikely to be used in practice; but it has been used in the wild (e.g. blocking of YouTube by Pakistani ISPs [23], and also those involving ConEd (US), TTNet (Turkey), Link Telekom (Russia) among others [46]) and remains viable as an orthogonal way of censoring traffic. So for completeness, we have also considered prefix hijacking as a potential tool for censoring the Internet in India.

In general, for a successful prefix hijack attack, the malicious AS either broadcasts a shorter path to the prefix, or claims to own it outright. The attacking AS advertises fake routes for the targeted prefix to all its neighbors. Ballani et al. [35] report that receiving ASes accept these advertisements based on the following heuristics:

1.
If there exists a customer path towards the target IP and iff the advertisement presents a shorter customer path, then choose it, else reject it.
2.
If there exist a provider path towards the target IP and iff the advertisement presents a shorter provider path, then accept it. For all other cases, the paths are accepted without considering the length.
3.
If there exist a peer path towards the target IP and iff the advertisement bears a shorter peer path, accept it. Customer paths are accepted without length considerations while provider paths are ignored.

Estimating the Impact of Prefix Hijack Attack: To study the potential impact IP prefix hijacking, we used the previously constructed AS-level topology and chose an attacker AS with a high node degree(i.e. the number of ASes adjacent to the said AS). Inspecting the prefix-to-AS paths, we identified ASes with which the attacker AS had a business relationship, and applied Ballani’s heuristics to determine the number of ASes potentially affected by fake advertisements.

Collateral Damage Due to Traffic Censorship: Several non-Indian ASes rely on Indian ASes for Internet connectivity. Censorship activities in Indian ASes may potentially filter the traffic of these non-Indian customers as well [33]. For example, such unintended filtering was reported by Omantel, that peers with the Indian ISP Bharti Airtel [17]. As one of our research objectives, we try to identify ASes outside India that may be affected by Indian censorship. We identify paths which do not originate in India, but pass through or terminate in India. The non-Indian customers on such paths may face unwanted access restrictions.

4 Experimental Results

Continuing from the description of our experiment in the previous section, in this section we present our results. First, we consider router-level filtering, and how many ASes and routers must be selected for effective censorship (in terms of coverage of paths to filtered destinations). Along similar lines, we identify the locations where the adversary could launch a DNS injection attack. We go on to present the results of simulating IP prefix hijack attacks on Indian ASes. Finally, we report the collateral damage to foreign ASes due to IP filtering in India.

4.1 Network Locations for IP (Router-Level) Filtering

As mentioned earlier, we first obtained paths connecting Indian ASes to about 100 potential target sites (chosen from our corpus). Figure 3 represents the number of paths an individual AS intercepts; the horizontal axis of the graph indicates the ASes, ranked according to the number of paths each one intercepts. A small number of Indian ASes appear in the overwhelming majority of these paths; these ASNs and their owner organizations are presented in the Table 2.

Table 2. AS Ranks, their ASNs and their owners.

Full size table

The question remains whether the ASes we observe are simply an artifact of the 100 target sites we chose. To check whether this is so, we repeated the experiment with another (non-overlapping) sample of 220 target sites from our corpus. The same 10 ASes covered the vast majority of paths to both sets of target sites, indicating that they are very likely major Indian providers of Internet infrastructure, and cover a majority of paths to any target sites.

The cumulative results of paths intercepted vs total number of ASes, corresponding to both experiments, is presented in Fig. 2. As evident, we only need 4 ASes to censor over \(90\%\) of the paths to the censored destinations, and 10 ASes for \(95\%\) of the paths. Figure 3 represents the number of paths intercepted by each of these ASes individually.

Intra-AS Topology: We now consider the question of which routers (in our key ASes) are responsible for carrying the vast majority of Indian Internet traffic. Following Mahajan et al.’s approach [58] (as described previously in Subsect. 3.3), we create router-level maps of the key ASes, and identify routers that appear on a large fraction of the paths.

Figure 4 shows the fraction of paths these routers cumulatively intercept. (For privacy concerns, we refrain from revealing the IP addresses of these routers.)

Table 3. The total number of edge and core routers in 9 ASes that appear in over \(90\%\) of the discovered paths. For eg.,. AS4755 has a total of 8404 routers (1779 edge + 6229 core). However, the total number of edge routers (1779) is less than the number of heavy hitters (6434).

Full size table

Table 3 represents the number of edge and core routers that cumulatively appear in over \(90\%\) of the traceroute paths. The adversary could choose to place filters either at these points - heavy hitter routers of the heavy hitter ASes - or at the edge routers of the ASes, which together see all the traffic that passes through the AS. We find that the total number of edge routers is less than the number of “heavy-hitting” edge and core routers, and conclude that the lowest-cost solution for the adversary is to install censorship infrastructure on the (total of 4996) edge routers.

We note that, at present, the number of key routers varies significantly across ASes, from 7 to 1782. In case of the larger ASes, the AS network administrator could likely improve on our figures, by combining our findings with better information about the router-level topology and setting routing policy to pass all traffic through a smaller number of routers. Hence our count of 4996 routers is essentially an upper bound, limited by the policies of the present day.

Collateral Damage: Our graph of paths from censored prefixes to ASes has 186, 679 paths of Indian origin (\(1.76\%\) of paths). A comparable number - 121, 931 paths of foreign origin (1.15% of paths) - transit through or terminate in an Indian AS. Censorship by Indian ASes may inadvertently impact a very large number of unintended customers, across Finland, Hong Kong, Singapore, Malaysia, the US, and so on.

4.2 Censorship Through DNS Filtering

Using our approach for identifying open DNS resolvers, we identified a total of 55, 234 publicly accessible DNS servers from probing all 12.10 million Indian IPs.

After identifying the prefixes corresponding to these each resolver IP, we selected one corresponding to each AS^{Footnote 8} In all, we selected 355 prefixes, representative of 355 unique Indian ASes. Finally, using Gao’s algorithm, we estimated the paths from each Indian AS to the (prefixes corresponding to) DNS resolvers in India. Cumulatively, 8 ASes (according to path frequency) can intercept \(99.14\%\) of these paths, and potentially launch DNS based filtering or Injection attacks (see Fig. 5).

We note that these 8 ASes also appear among the 10 top ASes we identified for IP filtering and IP prefix hijacking. Hence, the same key routers for each of these ASes (as per Table 3) may be selected for installing infrastructure to launch DNS injection (or other DNS level filtering schemes). In all, 4906 routers across the 8 ASes can cumulatively filter DNS traffic for all Indian ASes^{Footnote 9}.

4.3 Censorship Through IP Prefix Hijacking

For IP prefix hijacking, we chose to simulate attacks from the ASes with high node degree. Based on our censored-prefix-to-AS topology graph, we identified the top 10 ASes by node degree, and determined the number of ASes potentially vulnerable to attacks from each of these ASes. The results of these simulations are presented in Table 4.

Table 4. IP prefix hijack: a single AS (e.g. AS9498), is well capable of censoring the traffic of all 896 Indian ASes and few (59) non-Indian ASes through prefix hijack attack.

Full size table

The table shows that a small number of ASes in India can potentially affect traffic from all Indian ASes, as well as a considerable number of foreign ones. For example, fake advertisements by AS4755 can impact a total of 955 ASes (896 Indian and 41 others). To effectively launch an IP prefix hijacking attack, the government needs control over the BGP speakers (which form a small fraction of all the routers of an AS); for ASes such as AS9730, with 7 edge and 63 core routers, this number is probably very small.

4.4 Analysis of Results

We observe that a very small number of ASes (less than 10) intercept a large fraction of AS-level paths connecting Indian ASes to our list of potentially censored sites (obtained from public announcements of censored sites in India), and that this affects a substantial number of foreign users as well. While this result is interesting, there remains the question of whether it applies to censored sites in general, or only the ones in our sample.

Our request to the Indian government, under its own Right to Information Act [25], for the complete list of censored sites^{Footnote 10}, was refused by the Indian Government Department of Telecommunications and IT, citing confidentiality concerns. Therefore, to cross-validate our results we randomly sampled two sets of target sites from our corpus, and ran our algorithm on each in isolation. The same set of key ASes appeared in both sets.^{Footnote 11}

We believe that DNS filtering is a viable threat. Should the aforementioned ASes filter DNS requests, they would also impact over \(99\%\) of the AS-level paths connecting Indian ASes to DNS resolvers both within and outside India (particularly services such as GoogleDNS and OpenDNS). We note in passing that DNS filtering is more powerful than simple IP filtering: even if a censored site were hosted in a Content Distribution Network (CDN), a user would be unable to reach its content on the CDN, as the request would still have the URL of the origin site, and would thus be filtered.

Finally, while IP prefix hijacking is rarely used (owing to its potential to cause major network outages - e.g., the Pakistan Government’s blocking of Youtube [23]), there exist five Indian ASes, each of which could censor traffic for all (or nearly all) Indian users by launching an IP prefix hijack attack. Moreover, only a handful of routers in each of these ASes – viz. the BGP speaking routers may be sufficient for such attacks.

5 Limitations and Future Work

5.1 Limitations

Our approach in this paper is to generate AS and router-level maps of India, and identify the key ASes and routers that intercept a large fraction of network paths. This approach is clearly limited to a snapshot of routing at a moment in time, and in fact we intend to see how our results vary over several years in future work. In addition, our AS-level and router-level mapping algorithms have the following limitations.

AS Path Estimation (Gao’s Algorithm): Our path estimation strategy is limited by the quality of publicly-available BGP routes.

Route-collector bias: It has been argued by Gregori et al. that the existing route collectors (like routeviews [27], BGPmon [29], RIPE [26], PCH [22] etc.) miss many of the peering relationships between smaller ASes; our map, as it uses Routeviews data, inherits this weakness.
Incorrect route advertisements: In general, BGP routes are known contain artifacts of misconfiguration and bogus advertisements [23, 50]. Our estimated paths may also be contaminated with such artifacts.

Router Level Topology Estimation: The discovered topology may not reveal the actual router-level paths for packets traveling between the IPs of the probed AS and the censored websites.

Router-level path variability: Router-level maps of an AS are far more variable than AS-level maps: the latter rely on AS peering information (which is based on business relationships, that do not change frequently), while the former change with network conditions. Routing tables themselves are prone to inconsistencies and bogus routes [48, 54].
Imperfect coverage by Traceroute: We used a large number of planetlab nodes to launch traceroute probes^{Footnote 12}, but there remains a chance that some routes are simply not covered; further increasing the number of vantage points, i.e. probing hosts, may improve our topology estimation by discovering new paths.
Routers filtering traceroute probes: In many cases, routers are configured to not reply to traceroute probes with the usual ICMP TTL Expired messages, and remain anonymous, thereby reducing the accuracy of our estimated router-level topology.

5.2 Future Work

Our study of Internet censorship in India can be directly extended to other nations; while our case study was done with Indian data, we make use of no features peculiar to India. We are currently extending our analysis to other countries, and developing metrics for how “centralized” a country is (i.e. how many key ASes it takes to censor traffic in a country), as well as how “central” it is in the global Internet (measured by the extent of collateral damage it can cause). There are several other directions to extend this research, which we will explore next.

First - objectionable content is frequently hosted on social media sites, or other sites with apparently benign URLs. Might the government target search engines and social networking sites as well(as seen in China)? Would this be a full blacklist, or partial?^{Footnote 13} And if so, would our key ASes be different for these target websites?

There is also the question of whether popular anti-censorship and anonymity preserving tools like Tor may be attacked by controlling a few network points. Finally, we also intend to consider the question of policing the cellular data network^{Footnote 14}, in our future work.

6 Concluding Remarks

Though the Indian state declares that it has a unified Internet censorship policy, the current state of censorship (where the responsibility of network filtering is left to individual ASes) is highly inconsistent. However, our results also show that if the Indian government wishes to impose a single policy, the structure of the Indian Internet shows that it would only need to control a small set of locations. (Furthermore, a significant fraction of network paths from foreign customers, which transit India, will be collateral damage for Indian censorship.)

1.
Though India has \({\approx }900\) ASes, 10 ASes cover \({\approx }95\%\) of AS-level paths; a nationwide censor using IP-filtering functionality would need to control \({\approx }5000\) routers – a challenging, but tractable, number. In particular, two private ISP networks control over \(70\%\) of those routers (and may optimize the router selection further).
2.
DNS based filtering requires only eight of these ASes and impacts \({>}99\%\) of the AS-level paths connecting Indian ASes to the DNS resolvers both within and outside India (for services like GoogleDNS and OpenDNS).
3.
Any one of five ASes is capable of disrupting network connectivity for all Indian ASes, through IP prefix hijacking attacks.

India, unlike China, is still ambivalent w.r.t. censorship, but the findings in this paper indicate that ordinary citizens should indeed be concerned (and possibly start to equip themselves with censorship circumvention techniques), as large scale censorship would not be very difficult for the government to implement.

Notes

1.
Information Technology Act of India 2008 (Section 69A).
2.
IP and URL blacklists [38] are common, but ISPs may choose to employ more invasive techniques, such as DNS Injection Attacks [47] or even IP Prefix Hijacking [35, 46].
3.
Several authors have mentioned how these blacklists vary over time [1, 11].
4.
Different interfaces of the same router, with different IPs, are called IP aliases.
5.
We explain these terms below.
- Censored: the ISP intercepted the requests, and responded with an HTML iframe displaying a filtering message (indicating that requested URL had been blocked as per the directions from the Department of Telecommunication).
- Open: Websites were accessible without filtering.
- Inaccessible: Websites were “down”. There was not enough information to determine if the sites were inaccessible due to network or system outages, or requests were deliberately filtered or throttled by the ISP.
.
6.
This may be due to unavailability or filtering by firewall(s).
7.
Alternatively, router misconfiguration can also lead to similar situations [54].
8.
For multiple prefixes belonging to same AS, we selected one with most resolvers.
9.
As mentioned in the previous sub-section, this number may be further reduced by routing optimization on the part of the AS network administrator.
10.
RTI number: DOTEL/R/2017/50126.
11.
We also note that these ASes are, in fact, partners to foreign network providers, and provide connectivity for almost every smaller AS in the country. This is perhaps unsurprising, given the hierarchical nature of the Internet as a whole [41].
12.
The looking-glass servers used by the original authors [58] were unavailable at the time of our experiments.
13.
Semantics-based filtering is very hard; e.g. attempts to block jihadi mouthpiece sites also block sites that monitor jihad as a threat, such as jihadwatch.org.
14.
As per reports published in recent years, India has 860 million cellular users [10].

References

830 more websites blocked in India, many torrent links in list. http://indiatoday.intoday.in/technology/story/830-more-websites-blocked-in-india-many-torrent-links-in-list/1/748565.html
Alexa - Actionable Analytics for the Web. http://www.alexa.com/
Archipelago (ARK) Measurement Infrastructure. http://www.caida.org/projects/ark/
Censorship in France. http://www.laquadrature.net/en/french-parliament-approves-net-censorship
Censorship in India by Freedom House. https://freedomhouse.org/report-types/freedom-press
Censorship in Sweden. https://www.dangerandplay.com/2016/01/29/sweden-caught-censoring-the-internet-1984-style/
Censorship in Venezuela: Over 370 internet addresses blocked. https://panampost.com/pedro-garcia/2016/07/20/censorship-in-venezuela-over-370-internet-addresses-blocked/
Censorship is India by India Times. http://telecom.economictimes.indiatimes.com/tele-talk/internet-censorship-regulating-india-s-internet/1369
Court Cases Regarding Internet Censorship. https://opennet.net/news/india-court-summons-google-facebook-microsoft-executives
Government of India Department of Telecom. Telecom Annual report - India, 2012–2013 (2013). goo.gl/H7O13n
Govt of India wants 32 URLs, including Dailymotion, Vimeo and Github, banned. http://indianexpress.com/article/technology/social/government-wants-32-urls-including-dailymotion-vimeo-banned-in-india/
Herdict: Help Spot Web Blockages. http://herdict.org/
India is partly free by freedom house. https://freedomhouse.org/report/freedom-net/2011/india
India’s supreme court strikes down controversial internet censorship law. https://techcrunch.com/2015/03/23/indias-supreme-court-strikes-down-controversial-internet-censorship-law/
The internet censorship saga in India. https://internetdemocracy.in/2012/03/the-internet-censorship-saga-in-india/
IP to as mapping, team Cymru. http://www.team-cymru.org/IP-ASN-mapping.html
ISP of Oman suffers web filtering by Indian censorship. https://citizenlab.org/2012/07/routing-gone-wild/
Midar. http://www.caida.org/tools/measurement/midar/
Number of Indian internet users. http://www.internetlivestats.com/internet-users-by-country/
ONI report for India. https://opennet.net/research/profiles/india
OpenNet Initiative. https://opennet.net/
Packet Clearing House, San Francisco, CA, USA. http://www.pch.net
Pakistan Hijacks YouTube. https://www.ripe.net/publications/news/industry-developments/youtube-hijacking-a-ripe-ncc-ris-case-study
Porn websites blocked in India: Government plans ombudsman for online content. http://gadgets.ndtv.com/internet/news/porn-websites-blocked-in-india-government-plans-ombudsman-for-online-content-723485
Right to information, a citizen gateway. http://rti.gov.in/
Ripe NCC, Amsterdam, the Netherlands, ripe NCC routing information service. http://www.ripe.net/data-tools/stats/ris/routing-information-service
Route views project. http://archive.routeviews.org/
Service providers list - telecom regulatory authority of India. http://www.trai.gov.in/Content/ProviderListDisp/3_ProviderListDisp.aspx
University of Colorado, Fort Collins, CO, USA, BGPmon. http://bgpmon.netsec.colostate.edu
Websites blocked by Indian government. http://sflc.in/wp-content/uploads/2015/12/censorship.-2012-2015.pdf
University of Oregon Route Views Project (2000). http://www.routeviews.org/
Verkamp, J.P., Gupta, M.: Inferring mechanics of web censorship around the world. Presented as part of the 2nd USENIX Workshop on Free and Open Communications on the Internet. USENIX, Berkeley (2012)
Google Scholar
Anonymous. The collateral damage of internet censorship by DNS injection. SIGCOMM Comput. Commun. Rev. 42(3), 21–27 (2012)
Article Google Scholar
Aryan, S., Aryan, H., Halderman, J.A.: Internet censorship in Iran: a first look. Presented as part of the 3rd USENIX Workshop on Free and Open Communications on the Internet. USENIX, Berkeley (2013)
Google Scholar
Ballani, H., Francis, P., Zhang, X.: A study of prefix hijacking and interception in the internet. SIGCOMM Comput. Commun. Rev. 37(4), 265–276 (2007)
Article Google Scholar
Butler, K., Farley, T.R., McDaniel, P., Rexford, J.: A survey of BGP security issues and solutions. Proc. IEEE 98(1), 100–122 (2010)
Article Google Scholar
Crandall, J.R., Zinn, D., Byrd, M., Barr, E.T., East, R.: ConceptDoppler: a weather tracker for internet censorship. In: ACM Conference on Computer and Communications Security, pp. 352–365 (2007)
Google Scholar
Dalek, J., Haselton, B., Noman, H., Senft, A., Crete-Nishihata, M., Gill, P., Deibert, R.J.: A method for identifying and confirming the use of URL filtering products for censorship. In: Proceedings of the 2013 Conference on Internet Measurement Conference, pp. 23–30. ACM (2013)
Google Scholar
Dingledine, R., Mathewson, N., Syverson, P.: Tor: the second-generation onion router. In: Proceedings of the 13th USENIX Security Symposium, pp. 303–319, August 2004
Google Scholar
Dingledine, R., Mathewson, N., Syverson, P.: Tor: the second-generation onion router. Technical report, DTIC Document (2004)
Google Scholar
Gao, L.: On inferring autonomous system relationships in the internet. IEEE/ACM Trans. Netw. 9(6), 733–745 (2001)
Article Google Scholar
Goldberg, S., Schapira, M., Hummon, P., Rexford, J.: How secure are secure interdomain routing protocols. In: ACM SIGCOMM Computer Communication Review, vol. 40, pp. 87–98. ACM (2010)
Article Google Scholar
Guo, S., Feng, G.: Understanding support for internet censorship in China: an elaboration of the theory of reasoned action. J. Chin. Polit. Sci. 17(1), 33–52 (2012)
Article MathSciNet Google Scholar
Haddadi, H., Rio, M., Iannaccone, G., Moore, A., Mortier, R.: Network topologies: inference, modeling, and generation. IEEE Commun. Surv. Tutor. 10(2), 48–69 (2008)
Article Google Scholar
Hu, X., Mao, Z.M.: Accurate real-time identification of IP prefix hijacking. In: IEEE Symposium on Security and Privacy, SP 2007, pp. 3–17. IEEE (2007)
Google Scholar
Jacquemart, Q.: Towards uncovering BGP hijacking attacks. Ph.D. thesis, Télécom ParisTech, 2015
Google Scholar
Jones, B., Feamster, N., Paxson, V., Weaver, N., Allman, M.: Detecting DNS root manipulation. In: Karagiannis, T., Dimitropoulos, X. (eds.) PAM 2016. LNCS, vol. 9631, pp. 276–288. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30505-9_21
Chapter Google Scholar
Le, F., Lee, S., Wong, T., Kim, H.S., Newcomb, D.: Detecting network-wide and router-specific misconfigurations through data mining. IEEE/ACM Trans. Netw. 17(1), 66–79 (2009)
Article Google Scholar
Leberknight, C.S., Chiang, M., Poor, H.V., Wong, F.: A taxonomy of internet censorship and anti-censorship. In: Fifth International Conference on Fun with Algorithms (2010)
Google Scholar
Luckie, M.: Spurious routes in public BGP data. ACM SIGCOMM Comput. Commun. Rev. 44(3), 14–21 (2014)
Article Google Scholar
Lyon, G.: Nmap: The Network Mapper - Free Security Scanner. http://insecure.org/fyodor/
MacKinnon, R.: Flatter world and thicker walls? Blogs, censorship and civic discourse in China. Public Choice 134(1–2), 31–46 (2008)
Google Scholar
Madhyastha, H.V., Isdal, T., Piatek, M., Dixon, C., Anderson, T.E., Krishnamurthy, A., Venkataramani, A.: iPlane: an information plane for distributed services. In: Proceedings of 7th USENIX Symposium on Operating Systems Design and Implementation (OSDI), pp. 367–380, November 2006
Google Scholar
Mahajan, R., Wetherall, D., Anderson, T.: Understanding BGP misconfiguration. In: ACM SIGCOMM Computer Communication Review, vol. 32, pp. 3–16. ACM (2002)
Article Google Scholar
Nabi, Z.: The anatomy of web censorship in Pakistan. In: Presented as part of the 3rd USENIX Workshop on Free and Open Communications on the Internet. USENIX, Berkeley (2013)
Google Scholar
Qiu, J., Gao, L.: As path inference by exploiting known as paths. In: Global Telecommunications Conference, GLOBECOM 2006, pp. 1–5. IEEE (2006)
Google Scholar
Qiu, J., Gao, L., Ranjan, S., Nucci, A.: Detecting bogus BGP route information: going beyond prefix hijacking. In: Third International Conference on Security and Privacy in Communications Networks and the Workshops, SecureComm 2007, pp. 381–390. IEEE (2007)
Google Scholar
Rocketfuel: An ISP Topology Mapping Engine. http://www.cs.washington.edu/research/networking/rocketfuel/
Singh, R., Koo, H., Miramirkhani, N., Mirhaj, F., Gill, P., Akoglu, L.: The politics of routing: investigating the relationship between as connectivity and internet freedom. In: 6th USENIX Workshop on Free and Open Communications on the Internet (FOCI 2016). USENIX Association (2016)
Google Scholar
Stevenson, C.: Breaching the great firewall: China’s internet censorship and the quest for freedom of expression in a connected world. BC Int. Comp. L. Rev. 30, 531 (2007)
Google Scholar
Winter, P., Lindskog, S.: How the great firewall of China is blocking tor. In: Proceedings of the USENIX Workshop on Free and Open Communications on the Internet (FOCI 2012), August 2012
Google Scholar
Xu, X., Mao, Z.M., Halderman, J.A.: Internet censorship in China: where does the filtering occur? In: Spring, N., Riley, G.F. (eds.) PAM 2011. LNCS, vol. 6579, pp. 133–142. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-19260-9_14
Chapter Google Scholar
Zittrain, J., Edelman, B.: Internet filtering in China. IEEE Internet Comput. 7(2), 70–77 (2003)
Article Google Scholar

Download references

Author information

Authors and Affiliations

IIIT Delhi, New Delhi, India
Devashish Gosain, Anshika Agarwal, Sahil Shekhawat & Sambuddho Chakravarty
Rochester Institute of Information Technology, Rochester, NY, USA
H. B. Acharya

Authors

Devashish Gosain
View author publications
You can also search for this author in PubMed Google Scholar
Anshika Agarwal
View author publications
You can also search for this author in PubMed Google Scholar
Sahil Shekhawat
View author publications
You can also search for this author in PubMed Google Scholar
H. B. Acharya
View author publications
You can also search for this author in PubMed Google Scholar
Sambuddho Chakravarty
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Devashish Gosain .

Editor information

Editors and Affiliations

Wilfrid Laurier University, Waterloo, Ontario, Canada
Xiaodong Lin
University of New Brunswick, Fredericton, New Brunswick, Canada
Ali Ghorbani
University at Buffalo, Buffalo, New York, USA
Kui Ren
Pennsylvania State University, Philadelphia, Pennsylvania, USA
Sencun Zhu
Anhui Normal University, Wuhu, China
Aiqing Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gosain, D., Agarwal, A., Shekhawat, S., Acharya, H.B., Chakravarty, S. (2018). Mending Wall: On the Implementation of Censorship in India. In: Lin, X., Ghorbani, A., Ren, K., Zhu, S., Zhang, A. (eds) Security and Privacy in Communication Networks. SecureComm 2017. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 238. Springer, Cham. https://doi.org/10.1007/978-3-319-78813-5_21

Download citation

DOI: https://doi.org/10.1007/978-3-319-78813-5_21
Published: 11 April 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-78812-8
Online ISBN: 978-3-319-78813-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Mending Wall: On the Implementation of Censorship in India

Abstract

Similar content being viewed by others

Internet Censorship in Italy: A First Look at 3G/4G Networks

Internet Censorship Capabilities in Cyprus: An Investigation of Online Gambling Blocklisting

Leveraging Internet Services to Evade Censorship

Keywords

1 Introduction