Commit 1c0a4b

2024-11-01 08:33:58 Qwas: Save archivebox 使用
/dev/null .. archivebox \344\275\277\347\224\250.md
@@ 0,0 1,77 @@
+ # archivebox 使用
+
+ (未完待续)
+
+ ## 安装 archivebox
+
+ ```sh
+ # create a folder to store your data (can be anywhere)
+ mkdir -p ~/archivebox/data && cd ~/archivebox
+
+ # download the compose file into the directory
+ # curl -fsSL 'https://docker-compose.archivebox.io' > docker-compose.yml
+ curl --proxy http://127.0.0.1:7890 -fsSL 'https://docker-compose.archivebox.io' > docker-compose.yml
+
+ # (shortcut for getting https://raw.githubusercontent.com/ArchiveBox/ArchiveBox/stable/docker-compose.yml)
+
+ # initialize your collection and create an admin user for the Web UI (or set ADMIN_USERNAME/ADMIN_PASSWORD env vars)
+ docker compose run archivebox init
+ docker compose run archivebox manage createsuperuser
+ ```
+
+ ## sonic 全文检索
+
+ ```sh
+ # download the sonic config file into your data folder (e.g. ~/archivebox)
+ # curl -fsSL 'https://raw.githubusercontent.com/ArchiveBox/ArchiveBox/dev/etc/sonic.cfg' > sonic.cfg
+ curl --proxy http://127.0.0.1:7890 -fsSL 'https://raw.githubusercontent.com/ArchiveBox/ArchiveBox/dev/etc/sonic.cfg' > sonic.cfg
+
+ # then uncomment the sonic-related sections in docker-compose.yml
+ vi docker-compose.yml
+
+ # to backfill any existing archive data into the search index, run:
+ docker compose run archivebox update --index-only
+ ```
+
+ ```sh
+ docker compose up -d
+ ```
+
+ 此时 访问 ip:8000 可以浏览页面了
+
+
+
+ ## 安装 chrome
+
+ 抓取需要登录的内容,通过 cookie 设置
+
+ ```sh
+ sudo apt update
+ sudo apt install chromium-browser
+ # or on some systems:
+ sudo apt install chromium
+ ```
+
+ 修改docker-compose.yml
+
+ ```yml
+ services:
+ archivebox:
+ ...
+ volumes:
+ ...
+ - ./data/personas/Default:/data/personas/Default
+ environment:
+ - CHROME_USER_DATA_DIR=/data/personas/Default/chrome_profile
+ - DISPLAY=novnc:0.0
+ novnc:
+ image: theasp/novnc:latest
+ environment:
+ - DISPLAY_WIDTH=1920
+ - DISPLAY_HEIGHT=1080
+ - RUN_XTERM=no
+ ports:
+ - "8080:8080"
+ ```
+
+ 添加 CHROME_USER_DATA_DIR和 DISPLAY
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9