Skip to content

StealthScrape is a powerful and efficient file-scraping tool designed to extract specific file types from websites with ease. It automates the process of collecting PDFs, XLS, XML, HTML, PHP, JS, CSS, and more from a given domain. The tool operates through a simple command-line interface, prompting users for necessary inputs like the target domain

License

Notifications You must be signed in to change notification settings

CanXploit/StealthScrape

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Project Thumbnail

StealthScrape

StealthScrape is a free and open-source tool designed to scrape and grab files of various types (e.g., PDF, XLS, XML, HTML, HTM, PHP, JS, CSS, etc.) from a specified website domain. It is lightweight, customizable, and easy to use.

Features

  • Scrapes files with different extensions like .pdf, .xls, .xml, .html, .htm, .php, .js, .css, and more.
  • Simple terminal use
  • Free and open-source.
  • Customizable to fit your needs.

Installation

  1. Clone this repository:

    git clone https://github.com/CanXploit/StealthScrape.git
  2. Navigate into the project directory:

    cd StealthScrape
  3. Install the required dependencies (if any) via pip:

    pip install -r requirements.txt

Usage

  1. After installation, run the tool using the following command:

    python stealthscrape.py
  2. The tool will prompt you to enter the following:

    • Domain URL: Enter the domain you want to scrape (e.g., example.com).
    • Proxy File: Enter the name of the proxy file (optional, if needed for scraping).
  3. Follow the prompts to specify the file extensions you want to grab (e.g., pdf,xls,html,js).

  4. The tool will start scraping the website and download the files into the current directory.

Screenshot

Below is a screenshot showing how StealthScrape works from the terminal:

StealthScrape Terminal StealthScrape Terminal2

Next Updates

Fetch Files Downloaded For any data impotent like "Key" "data" "token" and other

Videos

ShowCase

The Team

CanXploit
@CanXploit

License

StealthScrape is licensed under the License. See the LICENSE file for more details.

ALL FOR EDUCATIONAL PURPOSES, I DONT NOT CONDONE OR PROMOTE HACKING OR GAINING UNFAIR ADVANTAGES IN ANY SITE OR IP

About

StealthScrape is a powerful and efficient file-scraping tool designed to extract specific file types from websites with ease. It automates the process of collecting PDFs, XLS, XML, HTML, PHP, JS, CSS, and more from a given domain. The tool operates through a simple command-line interface, prompting users for necessary inputs like the target domain

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages