Measuring Popularity of Cryptographic Libraries in Internet-Wide Scans

Nemec,  Matúš; Klinec,  Dušan; Švenda,  Petr; Sekan,  Peter; Matyáš,  Václav

Measuring Popularity of Cryptographic Libraries in Internet-Wide Scans

Warning

This publication doesn't include Faculty of Sports Studies. It includes Faculty of Informatics. Official publication website can be found on muni.cz.

Authors	NEMEC Matúš KLINEC Dušan ŠVENDA Petr SEKAN Peter MATYÁŠ Václav
Year of publication	2017
Type	Article in Proceedings
Conference	Proceedings of the 33rd Annual Computer Security Applications Conference
MU Faculty or unit	Faculty of Informatics
Citation
web	Supplementary materials Proceedings of the 33rd Annual Computer Security Applications Conference Measuring Popularity of Cryptographic Libraries in Internet-Wide Scans
Doi	http://dx.doi.org/10.1145/3134600.3134612
Field	Informatics
Keywords	RSA algorithm; cryptographic library; prime generation
Description	We measure the popularity of cryptographic libraries in large datasets of RSA public keys. We do so by improving a recently proposed method based on biases introduced by alternative implementations of prime selection in different cryptographic libraries. We extend the previous work by applying statistical inference to approximate a share of libraries matching an observed distribution of RSA keys in an inspected dataset (e.g., Internet-wide scan of TLS handshakes). The sensitivity of our method is sufficient to detect transient events such as a periodic insertion of keys from a specific library into Certificate Transparency logs and inconsistencies in archived datasets. We apply the method on keys from multiple Internet-wide scans collected in years 2010 through 2017, on Certificate Transparency logs and on separate datasets for PGP keys and SSH keys. The results quantify a strong dominance of OpenSSL with more than 84% TLS keys for Alexa 1M domains, steadily increasing since the first measurement. OpenSSL is even more popular for GitHub client-side SSH keys, with a share larger than 96%. Surprisingly, new certificates inserted in Certificate Transparency logs on certain days contain more than 20% keys most likely originating from Java libraries, while TLS scans contain less than 5% of such keys. Since the ground truth is not known, we compared our measurements with other estimates and simulated different scenarios to evaluate the accuracy of our method. To our best knowledge, this is the first accurate measurement of the popularity of cryptographic libraries not based on proxy information like web server fingerprinting, but directly on the number of observed unique keys.
Related projects:	Advancing cryptanalytic methods through evolutionary computing Zapojení studentů Fakulty informatiky do mezinárodní vědecké komunity