Dernière minute
FRJournaliste d'Al Jazeera tué dans un bombardement israélien à GazaFRFrappes israéliennes au Liban : le détroit d'Ormuz fermé par l'Iran, un soldat israélien tuéFRCanicule : 35 départements en vigilance rouge, des milliers de personnes manifestent pour les droits LGBT+FRUn bébé de 1 an tué par balle par la police lors d'une interventionFRDeux adolescents meurent noyés dans le Doubs à Besançon, la canicule suscite des inquiétudesFRConsommation d'alcool interdite sur la voie publique à Paris et dans d'autres départementsFRTop 14 : Dupont titulaire, Kinghorn à l’arrière, la compo du Stade Toulousain face au Racing 92FRWilliam Saliba : « Serrer les dents pour une Coupe du monde, c’est tous les quatre ans »FRCôte d'Ivoire ouvre le score face à l'AllemagneFRCoupe du Monde 2026 : les infos du jourFRJournaliste d'Al Jazeera tué dans un bombardement israélien à GazaFRFrappes israéliennes au Liban : le détroit d'Ormuz fermé par l'Iran, un soldat israélien tuéFRCanicule : 35 départements en vigilance rouge, des milliers de personnes manifestent pour les droits LGBT+FRUn bébé de 1 an tué par balle par la police lors d'une interventionFRDeux adolescents meurent noyés dans le Doubs à Besançon, la canicule suscite des inquiétudesFRConsommation d'alcool interdite sur la voie publique à Paris et dans d'autres départementsFRTop 14 : Dupont titulaire, Kinghorn à l’arrière, la compo du Stade Toulousain face au Racing 92FRWilliam Saliba : « Serrer les dents pour une Coupe du monde, c’est tous les quatre ans »FRCôte d'Ivoire ouvre le score face à l'AllemagneFRCoupe du Monde 2026 : les infos du jour
Newsgather
BackVisualizing Enormous Malware Datasets: A Stacking Thought Experiment
Visualizing Enormous Malware Datasets: A Stacking Thought Experiment
Tech
TechCrunch13.05.2026Tech2 dk okumaUnited States

Visualizing Enormous Malware Datasets: A Stacking Thought Experiment

L'essentiel

Malware repositories vx-underground (30TB) and VirusTotal (31PB) are visualized in terms of stacked hard drives, comparing their heights to landmarks like the Eiffel Tower and Burj Khalifa.

Résumé généré par IA

Pourquoi c'est important

Malware research and storage are critical for cybersecurity and AI training.

Taille de police

Malware research group vx-underground, which says it has the largest collection of malware source code, said in a post on X that its archive of data amounts to about 30 terabytes. A reply by Bernardo Quintero, founder of VirusTotal, an online service that scans files for malware across multiple antivirus engines at once, said his service has about 31 petabytes of malware samples that users have contributed to date. (A petabyte is ~1,000x larger than a terabyte.) In both cases, that’s a lot of data. For context, cybersecurity companies, AI researchers, and threat intelligence firms treat repositories like these as critical for training detection models and understanding how attacks evolve. But this had us wondering: What would these enormous datasets actually look like stacked as hard drives one on top of the other and side by side? And how would they compare to, say, the Eiffel Tower? Someone in our newsroom asked an AI chatbot this question, and it got it incredibly wrong. Instead, we did some rough back-of-a-napkin math to figure out how tall these data banks would be. Since vx-underground and VirusTotal both have “about” that much data each, “about” is good enough for us in this case. Let’s say we’re using 1 terabyte capacity internal hard drives, since these are generally designed to be the same physical size to fit inside any computer. These standardized 3.5-inch internal hard drives are 1 inch in height, which for the sake of stacking one on top of the other is really what we want to know here. We’re also assuming that the hard drives we’re using in this example are exactly 1 terabyte, because in reality the total usable file capacity of a hard drive is generally somewhat less. Using this online conversion tool, it looks like vx-underground’s 30 terabytes of malware data could fill 30 hard drives stacked on top of one another, reaching 30 inches, or about 2.5 feet tall. For reference, this reporter is 6 feet tall. (See visual below, and yes, terrible opsec, I know.) With that same logic, VirusTotal’s 31 petabytes of submitted data would fill 31,744 hard drives, which stacked on top of one another would reach about 2,645 feet. The world’s tallest building, the Burj Khalifa in Dubai, is slightly taller at 2,722 feet. The Eiffel Tower is 1,083 feet tall. By that logic, VirusTotal has about two and a half Eiffel Towers’ worth of data.

Questions ouvertes

  • How do these datasets impact AI model effectiveness?

Sujets liés

This article was originally published by TechCrunch.

Articles liés

Apple Unveils Numerous App and Service Upgrades at WWDC Beyond Siri
En développement·6 sa önce

Apple Unveils Numerous App and Service Upgrades at WWDC Beyond Siri

Apple announced significant updates to its core apps and services at WWDC, including enhanced Apple Maps with 'Local Lists' and improved 'Flyover,' more flexible location sharing in Find My, and advanced bill splitting in Apple Wallet powered by Apple Intelligence. Other updates include redesigned Apple Pay checkout, expanded Apple Music features like lyrics translation, new search capabilities in Apple Podcasts, improved iCloud Shared Albums, and a new Fitness+ program for menopause.

TechCrunch
Plus sur ce sujetmalware