A heavy-duty web file browser for cRaZy files.

Go to file

Cyberes 112ab0e08f clarify log item		2023-12-11 21:37:16 -07:00
Doc	Merge dev to master	2023-12-08 22:25:59 -07:00
src	clarify log item	2023-12-11 21:37:16 -07:00
.gitignore	should be pretty good!	2023-07-17 23:20:21 -06:00
LICENSE	Initial commit	2023-07-13 13:54:42 -06:00
README.md	track running crawls and add an admin page, use basic auth for admin, reject crawl if already running for a path, limit max directory crawlers, fix some issues	2023-12-11 18:05:59 -07:00
config.yml.sample	should be pretty good!	2023-07-17 23:20:21 -06:00
todo.txt	update todo	2023-12-11 19:12:00 -07:00

README.md

crazy-file-server

A heavy-duty web file browser for CRAZY files.

The whole schtick of this program is that it caches the directory and file structures so that the server doesn't have to re-read the disk on every request. By doing the processing upfront when the server starts along with some background scans to keep the cache fresh we can keep requests snappy and responsive.

I needed to serve a very large dataset full of small files publicly over the internet in an easy to browse website. The existing solutions were subpar and I found myself having to create confusing Openresty scripts and complex CDN caching to keep things responsive and server load low. I gave up and decided to create my own solution.

You will likely need to store your data on an SSD for this. With an SSD, my server was able to crawl over 6 million files stored in a very complicated directory tree in just 5 minutes.

Features

Automated cache management. Fill the cache when the starts, or as requests come in.
File browsing API.
Download API.
Restrict certain files and directories from the download API to prevent users from downloading your entire 100GB+ dataset.
Frontend-agnostic design.
Basic searching or Elasticsearch integration.

Install

Install Go.
Download the binary or do cd src && go mod tidy && go build

Use

Edit config.yml. It's well commented.
./crazyfs --config /path/to/config.yml. You can use -d for debug mode to see what it's doing.

By default, it looks for your config in the same directory as the executable: ./config.yml or ./config.yaml.

If you're using initial cache and have tons of files to scan you'll need at least 5GB of RAM and will have to wait 10 or so minutes for it to traverse the directory structure. CrazyFS is heavily threaded, so you'll want at least an 8-core machine.

You'll need something line Nginx if you want SSL or HTTP. Also, CrazyFS works great with an HTTP cache in front of it.