Katti: An Extensive and Scalable Tool for Website Analyses

Katti is a comprehensive tool designed to address technical challenges in web crawling and analysis, particularly for security and privacy research. The tool enables large-scale website analyses by providing modular components for crawling, data processing, and visualization, while ensuring scalability and adaptability to diverse research needs. Its architecture supports extensibility through custom plugins and parallelized execution for efficient processing of complex website structures. The paper presents Katti’s technical implementation details, including its distributed architecture and API design, along with case studies demonstrating its effectiveness in detecting security vulnerabilities and privacy risks across modern web applications. The open-source implementation allows researchers to conduct reproducible analyses while handling the technical complexities of modern web technologies.