ProxmoxVE

mirror of https://github.com/community-scripts/ProxmoxVE.git synced 2026-03-02 17:05:55 +01:00

Author	SHA1	Message	Date
CanbiZ (MickLesk)	fddc47064d	core: read from /dev/tty in all interactive prompts \| fix empty or cropped logs due build process (#12406 )	2026-02-28 14:51:26 +01:00
CanbiZ (MickLesk)	a2dc3f44d3	feat: graceful fallback for apt-get update failures (#12386 ) Add apt_update_safe() function that warns instead of aborting when apt-get update fails (e.g. enterprise repo 401 Unauthorized). Shows a helpful hint about disabling the enterprise repo when no subscription is active. Replaces direct apt-get update calls in build.func and install.func.	2026-02-27 14:39:39 +01:00
CanbiZ (MickLesk)	774bbbc6d5	core: Improve error outputs across core functions (#12378 ) * Improve error outputs across core functions * Update tools.func	2026-02-27 13:59:02 +01:00
CanbiZ (MickLesk)	b09f2db2a9	Handle job-control signals and clear tostop Prevent terminal job-control signals from suspending the script during recovery by trapping TSTP, TTIN and TTOU (instead of only TSTP) and restoring them on exit. Also clear the terminal 'tostop' flag in stop_spinner() with `stty -tostop` to avoid background spinner I/O from stopping the process group.	2026-02-25 14:45:11 +01:00
CanbiZ (MickLesk)	dd46dd2d87	core: remove duplicate traps, consolidate error handling and harden signal traps (#12316 ) * fix(zammad): configure Elasticsearch for LXC container startup - Set discovery.type: single-node (required for single-node ES) - Set xpack.security.enabled: false (not needed in local LXC) - Set bootstrap.memory_lock: false (fails in unprivileged LXC) - Add startup wait loop (up to 60s) to ensure ES is ready before Zammad installation continues Fixes #12301-related recurring Elasticsearch startup failures * refactor(api): eliminate duplicate traps, harden error handling & telemetry Phase 1 - Structural: - Remove api_exit_script() and 5 inline traps from build.func - error_handler.func is now the sole trap owner via catch_errors() - Update api.func comment reference (api_exit_script -> on_exit) Phase 2 - Quality: - Add stop_spinner() + cursor restore to error_handler(), on_interrupt(), on_terminate(), on_hangup() to prevent spinner/cursor artifacts - Enhance _send_abort_telemetry() with error text (last 20 log lines), duration calculation, and 2 retry attempts (was fire-and-forget) - Harden json_escape() to also strip DEL (0x7F) character * fix(build): show spinner during post_update_to_api to prevent Ctrl+Z abort post_update_to_api can take up to 33 seconds worst-case (3 curl attempts x 10s timeout + sleep delays). Without any terminal output during this time, users think the script is stuck and press Ctrl+Z, which prevents the recovery menu from ever appearing. Add msg_info spinner before both post_update_to_api calls in the failure path (initial report + final force retry after recovery menu). * fix(build): prevent SIGTSTP from killing recovery dialog - Replace msg_info/stop_spinner with plain echo for telemetry reporting The background spinner process in non-interactive shells (bash -c) can trigger SIGTSTP, stopping the entire process group before the recovery dialog appears. Plain echo avoids this. - Add trap '' TSTP at failure path entry to ignore suspension signals Prevents Ctrl+Z or terminal-related SIGTSTP from interrupting the recovery menu. Restored with trap - TSTP before exit. - Root cause: msg_info starts a background process (spinner &) that is not properly detached in non-interactive shells where job control (set -m) is OFF. The disown builtin has no effect without job control, leaving the spinner in the same process group. This can cause terminal I/O conflicts during the 33-second post_update_to_api retry window, resulting in [2]+ Stopped. * fix(test): initialize colors and remove illegal local in test harness - Call load_functions() after sourcing core.func to initialize color/formatting/icon variables (RD, GN, YW, CL, TAB, etc.) - Remove 'local' keyword from top-level scope (not inside function) - Default REPO_SOURCE to ref_api instead of main * chore: remove test-recovery-dialog.sh from branch * Revert "fix(zammad): configure Elasticsearch for LXC container startup" This reverts commit `10e450b72f`. * fix(build): show telemetry status only in verbose mode Telemetry reporting is an implementation detail that doesn't help the user during failure recovery. Wrap echo statements with VERBOSE check so they only appear when verbose mode is enabled.	2026-02-25 14:08:24 +01:00
Tempest	8c0016c0a7	Fix detection of ssh keys (#12230 )	2026-02-25 08:23:14 +01:00
CanbiZ (MickLesk)	0e364adb54	core: fix broken "command not found" after err_trap (#12280 )	2026-02-24 14:22:07 +01:00
CanbiZ (MickLesk)	3c83654666	fix(telemetry): move 'configuring' transition to right after container creation Validation status was persisting through container start, network check, and OS customization. Now transitions to 'configuring' immediately after create_lxc_container returns. Validation only covers storage/template/cluster checks as intended.	2026-02-23 17:11:14 +01:00
CanbiZ (MickLesk)	ae3a249854	Capture lxc-attach output to host log Pipe lxc-attach output through tee into /tmp/.install-capture-${SESSION_ID}.log and use PIPESTATUS[0] to get the real lxc-attach exit code. Prefer a pulled container-side INSTALL_LOG when it exists and is >100 bytes; otherwise fall back to the host-captured terminal log (stripping ANSI codes) and append it to the combined log so get_full_log() can find it. Apply the same capture behavior to the retry path and remove temporary capture files on completion. This makes install output reliable when container-side logging is missing (DNS errors, early crashes, or missing silent() usage).	2026-02-23 17:08:38 +01:00
CanbiZ (MickLesk)	a8a1cbcf3e	feat(telemetry): add 'validation' status, fix status transitions, show 20 log lines Status flow is now: installing → validation → configuring → success/failed Changes: - post_progress_to_api() accepts optional status parameter (default: configuring) - build.func: Send 'validation' before storage/template/cluster checks - build.func: Send 'configuring' just before lxc-attach (app install) - build.func: Remove redundant progress pings during container start/network - install.func + alpine-install.func: Accept status parameter in container-side post_progress_to_api() - core.func + vm-core.func: silent() now shows last 20 lines on error (was 10)	2026-02-23 17:01:18 +01:00
CanbiZ (MickLesk)	60f9622998	fix(core): keep host-side logging on BUILD_LOG after INSTALL_LOG export After 'export INSTALL_LOG' in build.func, get_active_logfile() returned the container's INSTALL_LOG path for all host-side logging, causing msg_info/msg_ok/msg_error on the host to write to /root/.install-SESSION.log (the host file, not the container's) instead of BUILD_LOG. This made BUILD_LOG incomplete and get_full_log() unable to send full traces. Fix: Add _HOST_LOGFILE (not exported, invisible to container) so the host always logs to BUILD_LOG. Container still uses INSTALL_LOG as before.	2026-02-23 16:07:13 +01:00
CanbiZ (MickLesk)	691cec80ab	core: Enhance signal handling, reported "status" and logs (#12216 ) * Enhance telemetry, signal handling, and logs Improve failure telemetry and signal handling across the installer: add get_full_log() to collect/strip/truncate install logs and include them in API payloads with a truncated retry; add CONTAINER_INSTALLING flag around lxc-attach and stop containers on abort to avoid orphaned "installing/configuring" records; introduce _send_abort_telemetry() (curl fallback for container context) and _stop_container_if_installing() helpers; centralize and simplify EXIT/ERR/INT/TERM/HUP traps and handlers (including a new on_hangup handler) and update VM scripts to report numeric exit codes. Also ensure best-effort log collection is performed and tweak error categorization for certain signals. * Include full log in error telemetry Use get_full_log (up to 120KB) to populate the error telemetry field so the API receives the full installation trace; fall back to get_error_text (last ~20 lines) if the full log is empty. Removed collection and inclusion of a separate install_log field from the JSON payloads and simplified the retry payloads/comments accordingly. The change ensures error reports contain the complete trace while avoiding duplicate large log fields and keeps graceful failure handling (get_full_log \|\| true). * Anonymize IP addresses in get_full_log Mask IPv4 addresses in logs when collecting full log output: added a sed step that replaces the last two octets with "x.x" to avoid exposing full IPs (GDPR). Also updated the comment to reflect anonymization; existing steps that strip carriage returns and ANSI escape sequences remain in place before truncating with head -c.	2026-02-23 14:30:48 +01:00
CanbiZ (MickLesk)	69de9fa57e	Improve error handling and logging for LXC builds (#12208 ) Prevent host-side error_handler from being triggered during in-container install/recovery by delaying re-enabling set -Eeuo pipefail and the ERR trap in misc/build.func until after install/recovery completes; add explanatory comments. Update misc/error_handler.func to fall back to BUILD_LOG if container-internal log path is unavailable, show the last 20 log lines when present, refine container vs host detection (check INSTALL_LOG file and /root), copy INSTALL_LOG into /root and write a .failed flag with the exit code for host-side detection, and ensure full-log output and container removal prompt are shown appropriately in host context. Tweak misc/core.func silent() output to include a "Full log" path and adjust formatting.	2026-02-23 13:22:09 +01:00
CanbiZ (MickLesk)	491081ffbf	Add post_progress_to_api lightweight telemetry ping Introduce post_progress_to_api() in misc/api.func — a non-blocking, fire-and-forget curl ping (gated by DIAGNOSTICS and RANDOM_UUID) that updates telemetry status to "configuring". Wire this progress ping into multiple scripts (alpine-install.func, install.func, build.func, core.func) at key milestones (container start, network ready, customization, creation, cleanup) and replace/deduplicate some earlier post_to_api calls. Also update error_handler.func to always report failures immediately via post_update_to_api to ensure failures are captured even before/after container lifecycle.	2026-02-18 16:19:19 +01:00
CanbiZ (MickLesk)	6cc8877852	Add timeouts and prioritize telemetry on exit Prevent hangs when pulling logs from containers by wrapping pct pull calls with timeout (8s) and running ensure_log_on_host under timeout (10s). Always send telemetry (post_update_to_api) before attempting best-effort log collection so status is reported even if log retrieval blocks. Update EXIT/ERR/SIGHUP/SIGINT/SIGTERM traps and consolidate error/interrupt handlers to use the new timeouted log collection. Changes in misc/build.func and misc/error_handler.func.	2026-02-18 13:14:59 +01:00
CanbiZ (MickLesk)	b439960222	core: Execution ID & Telemetry Improvements (#12041 ) * fix: send telemetry BEFORE log collection in signal handlers - Swap ensure_log_on_host/post_update_to_api order in on_interrupt, on_terminate, api_exit_script, and inline SIGHUP/SIGINT/SIGTERM traps - For signal exits (>128): send telemetry immediately, then best-effort log collection - Add 2>/dev/null \|\| true to all I/O in signal handlers to prevent SIGPIPE - Fix on_exit: exit_code=0 now reports 'done' instead of 'failed 1' - Root cause: pct pull hangs on dying containers blocked telemetry updates, leaving 595+ records stuck in 'installing' daily * feat: add execution_id to all telemetry payloads - Generate EXECUTION_ID from RANDOM_UUID in variables() - Export EXECUTION_ID to container environment - Add execution_id field to all 8 API payloads in api.func - Add execution_id to post_progress_to_api in install.func and alpine-install.func - Fallback to RANDOM_UUID when EXECUTION_ID not set (backward compat) * fix: correct telemetry type values for PVE and addon scripts - PVE scripts (tools/pve/): change type 'tool' -> 'pve' - Addon scripts (tools/addon/): fix 4 scripts that wrongly used 'tool' -> 'addon' (netdata, add-tailscale-lxc, add-netbird-lxc, all-templates) - api.func: post_tool_to_api sends type='pve', default fallback 'pve' - Aligns with PocketBase categories: lxc, vm, pve, addon * fix: persist diagnostics opt-in inside containers for addon telemetry - install.func + alpine-install.func: create /usr/local/community-scripts/diagnostics inside the container when DIAGNOSTICS=yes (from build.func export) - Enables addon scripts running later inside containers to find the opt-in - Update init_tool_telemetry default type from 'tool' to 'pve' * refactor: clean up diagnostics/telemetry opt-in system - diagnostics_check(): deduplicate heredoc (was 2x 22 lines), improve whiptail text with clear what/what-not collected, add telemetry + privacy links - diagnostics_menu(): better UX with current status, clear enable/disable buttons, note about existing containers - variables(): change DIAGNOSTICS default from 'yes' to 'no' (safe: no telemetry before user consents via diagnostics_check) - install.func + alpine-install.func: persist BOTH yes AND no in container so opt-out is explicit (not just missing file = no) - Fix typo 'menue' -> 'menu' in config file comments * fix: no pre-selection in telemetry dialog, link to telemetry-service README - Add --defaultno so 'No, opt out' is focused by default (user must Tab to Yes) - Change privacy link from discussions/1836 to telemetry-service#privacy--compliance * fix: use radiolist for telemetry dialog (no pre-selection) - Replace --yesno with --radiolist: user must actively SPACE-select an option - Both options start as OFF (no pre-selection) - Cancel/Exit defaults to 'no' (opt-out) * simplify: inline telemetry dialog text like other whiptail dialogs * improve: telemetry dialog with more detail, link to PRIVACY.md - Add what we collect / don't collect sections back to dialog - Link to telemetry-service/docs/PRIVACY.md instead of README anchor - Update config file comment with same link	2026-02-18 10:24:06 +01:00
CanbiZ (MickLesk)	3ce3c6f613	tools/pve: add data analytics / formatting / linting (#12034 ) * core: add progress; fix exit status Introduce post_progress_to_api() in alpine-install.func and install.func to send a lightweight, fire-and-forget telemetry ping (HTTP POST) that updates an existing telemetry record to "configuring" when DIAGNOSTICS=yes and RANDOM_UUID is set. The function is non-blocking (curl -m 5, errors ignored) and is invoked during container setup and after OS updates to signal active progress. Also adjust api_exit_script() in build.func to report success (post_update_to_api "done" "0") for cases where the script exited normally but a completion status wasn't posted, instead of reporting failure. * Safer tools.func load and improved error handling Replace process-substitution sourcing of tools.func with an explicit curl -> variable -> source via /dev/stdin, adding failure messages and a check that expected functions (e.g. fetch_and_deploy_gh_release) are present (misc/alpine-install.func, misc/install.func). Add categorize_error mapping for exit code 10 -> "config" (misc/api.func). Tweak build.func: minor pipeline formatting and change the ERR trap to capture the actual exit code and only call ensure_log_on_host/post_update on non-zero exits, preventing erroneous failure reporting. * tools: add data init and auto-reporting to tools and pve section Introduce telemetry helpers in misc/api.func: _telemetry_report_exit (reports success/failure via post_tool_to_api/post_addon_to_api) and init_tool_telemetry (reads DIAGNOSTICS, starts install timer and installs an EXIT trap to auto-report). Integrate telemetry into many tools/addon and tools/pve scripts by sourcing the remote api.func and calling init_tool_telemetry (guarded with declare -f). Also apply a minor arithmetic formatting tweak in misc/build.func for RECOVERY_ATTEMPT.	2026-02-17 16:36:20 +01:00
CanbiZ (MickLesk)	f07f2cb04e	core: error-handler improvements \| better exit_code handling \| better tools.func source check (#12019 ) * core: add progress; fix exit status Introduce post_progress_to_api() in alpine-install.func and install.func to send a lightweight, fire-and-forget telemetry ping (HTTP POST) that updates an existing telemetry record to "configuring" when DIAGNOSTICS=yes and RANDOM_UUID is set. The function is non-blocking (curl -m 5, errors ignored) and is invoked during container setup and after OS updates to signal active progress. Also adjust api_exit_script() in build.func to report success (post_update_to_api "done" "0") for cases where the script exited normally but a completion status wasn't posted, instead of reporting failure. * Safer tools.func load and improved error handling Replace process-substitution sourcing of tools.func with an explicit curl -> variable -> source via /dev/stdin, adding failure messages and a check that expected functions (e.g. fetch_and_deploy_gh_release) are present (misc/alpine-install.func, misc/install.func). Add categorize_error mapping for exit code 10 -> "config" (misc/api.func). Tweak build.func: minor pipeline formatting and change the ERR trap to capture the actual exit code and only call ensure_log_on_host/post_update on non-zero exits, preventing erroneous failure reporting.	2026-02-17 13:25:17 +01:00
CanbiZ (MickLesk)	c9ecb1ccca	core: smart recovery for failed installs \| extend exit_codes (#11221 ) * feat(build.func): smart error recovery menu for failed installations Replace simple Y/n removal prompt with interactive recovery menu: - Option 1: Remove container and exit (default, auto after 60s timeout) - Option 2: Keep container for debugging - Option 3: Retry installation with verbose mode enabled - Option 4: Retry with 1.5x RAM and +1 CPU core (OOM errors only) Improvements: - Detect OOM errors (exit codes 137, 243) and offer resource increase - Show human-readable error explanation using explain_exit_code() - Recursive rebuild preserves ALL settings from advanced/app.vars/default.vars - Settings preserved: Network (IP, Gateway, VLAN, MTU, Bridge), Features (Nesting, FUSE, TUN, GPU), Storage, SSH keys, Tags, Hostname, etc. - Show rebuild summary before retry (old→new CTID, resources, network) - New container ID generated automatically for rebuilds This helps users recover from transient failures without re-running the entire script manually. * fix(api.func): fix duplicate exit codes and add missing error codes Exit code fixes: - Remove duplicate definitions for codes 243, 254 (Node.js vs DB) - Reassign MySQL/MariaDB to 240-242, 244 (was 241-244) - Reassign MongoDB to 250-253 (was 251-254) New exit codes added (based on GitHub issues analysis): - 6: curl couldn't resolve host (DNS failure) - 7: curl failed to connect (network unreachable) - 22: curl HTTP error (404, 429 rate limit, 500) - 28: curl timeout (very common in download failures) - 35: curl SSL error - 102: APT lock held by another process - 124: Command timeout - 141: SIGPIPE (broken pipe) Also update OOM detection to include exit code 134 (SIGABRT) which is commonly seen in Node.js heap overflow issues. Fixes based on analysis of ~500 GitHub issues. * fix(exit-codes): sync error_handler.func and api.func with conflict-free code ranges - Add curl error codes (6, 7, 22, 28, 35) - Add APT lock code (102), timeout (124), signals (134, 141) - Move Python codes: 210-212 → 160-162 (avoid Proxmox conflict) - Move PostgreSQL codes: 231-234 → 170-173 - Move MySQL/MariaDB codes: 241-244 → 180-183 - Move MongoDB codes: 251-254 → 190-193 - Keep Node.js at 243-249, Proxmox at 200-231 - Both files now synchronized with identical mappings * feat(exit-codes): add systemd and build error codes (150-154) - 150: Systemd service failed to start - 151: Systemd service unit not found - 152: Permission denied (EACCES) - 153: Build/compile failed (make/gcc/cmake) - 154: Node.js native addon build failed (node-gyp) Based on issue analysis: 57 service failures, 25 build failures, 22 node-gyp issues * fix(build): restore smart recovery and add OOM/DNS retry paths * feat(build): APT in-place repair, exit 1 subclassification, new exit codes - Add APT/DPKG in-place recovery: detects exit 100/101/102/255 and exit 1 with APT log patterns, offers to repair dpkg state and re-run install script without destroying the container - Add exit 1 subclassification: analyzes combined log to identify root cause (APT, OOM, network, command-not-found) and routes to appropriate recovery option - Add exit 10 hint: shows privileged mode / nesting suggestion - Add exit 127 hint: extracts missing command name from logs - Refactor recovery menu: use named option variables (APT_OPTION, OOM_OPTION, DNS_OPTION) instead of hardcoded option numbers, supports up to 6 dynamic options cleanly - Map missing exit codes in api.func: curl 27/36/45/47/55, signals 129 (SIGHUP) / 131 (SIGQUIT), npm 239 * feat(api+build): map 25 more exit codes, add SIGHUP trap, network/perm hints api.func: - Map 25+ new exit codes that were showing as 'Unknown' in telemetry: curl: 3, 16, 18, 24, 26, 32-34, 39, 44, 46, 48, 51, 52, 57, 59, 61, 63, 79, 92, 95; signals: 125, 132, 144, 146 - Update code 8 description (FTP + apk untrusted key) - Update header comment with full supported ranges build.func: - Add SIGHUP trap: reports 'failed/129' to API when terminal is closed, should significantly reduce the 2841 stuck 'installing' records - Add exit 52 (empty reply) and 57 (poll error) to network issue detection for DNS override recovery option - Add exit 125/126 hint: suggests privileged mode for permission errors * fix: sync error_handler fallback, Alpine APK repair, retry limit error_handler.func: - Sync fallback explain_exit_code() with api.func: add 25+ codes that were missing (curl 16/18/24/26/27/32-34/36/39/44-48/51/52/55/57/59/ 61/63/79/92/95, signals 125/129/131/132/144/146, npm 239, code 3/8) - Ensures consistent error descriptions even when api.func isn't loaded build.func: - Alpine APK repair: detect var_os=alpine and run 'apk fix && apk cache clean && apk update' instead of apt-get/dpkg commands - Show 'Repair APK state' instead of 'APT/DPKG' in menu for Alpine - Retry safety counter: OOM x2 retry limited to max 2 attempts (prevents infinite RAM doubling via RECOVERY_ATTEMPT env var) - Show attempt count in rebuild summary * fix(build): preserve exit code in ERR trap to prevent false exit_code=0 The ERR trap called ensure_log_on_host before post_update_to_api, which reset \True to 0 (success). This caused ~15-20 records/day to be reported as 'failed' with exit_code=0 instead of the actual error code. Root cause chain: 1. Command fails with exit code N → ERR trap fires (\True = N) 2. ensure_log_on_host succeeds → \True becomes 0 3. post_update_to_api 'failed' '\True' → sends 'failed/0' (wrong!) 4. POST_UPDATE_DONE=true → EXIT trap skips the correct code Fix: capture \True into _ERR_CODE before ensure_log_on_host runs. * Implement telemetry settings and repo source detection Add telemetry configuration and repository source detection function.	2026-02-17 12:14:46 +01:00
CanbiZ (MickLesk)	2dddeaf966	Call get_lxc_ip in start() before updates (#12015 )	2026-02-17 09:08:09 +01:00
CanbiZ (MickLesk)	896714e06f	core/vm's: ensure script state is sent on script exit (#11991 ) * Ensure API update is sent on script exit Add exit-time telemetry handling across scripts to avoid orphaned "installing" records. Introduce local exit_code capture in api_exit_script and cleanup handlers and, when POST_TO_API_DONE is true but POST_UPDATE_DONE is not, post a final status (marking failures on non-zero exit codes, or marking done/failed in VM cleanups based on exit code). Changes touch misc/build.func, misc/vm-core.func and various vm/-vm.sh cleanup functions to reliably send post_update_to_api on normal or early exits. Update api.func * fix(telemetry): add missing exit codes to explain_exit_code() - Add curl error codes: 4, 5, 8, 23, 25, 30, 56, 78 - Add code 10: Docker/privileged mode required (used in ~15 scripts) - Add code 75: Temporary failure (retry later) - Add BSD sysexits.h codes: 64-77 - Sync error_handler.func fallback with canonical api.func	2026-02-16 17:14:00 +01:00
CanbiZ (MickLesk)	a8977a25d4	fix(build): SIGINT/SIGTERM traps now exit properly - Add 'exit 130' after SIGINT trap handler - Add 'exit 143' after SIGTERM trap handler - Fixes issue where interrupted scripts stayed as 'Installing' in telemetry - Previously the traps only sent the update but didn't terminate the script	2026-02-15 10:46:21 +01:00
CanbiZ (MickLesk)	911f533e6a	fix(cluster): validate container IDs cluster-wide across all nodes (#11906 ) - Query /cluster/resources via pvesh to check all VMs/CTs on ALL nodes - Check /etc/pve/nodes//qemu-server and /etc/pve/nodes//lxc dirs - Handles pmxcfs sync delays that caused sporadic ID conflicts - Remove duplicate validate_container_id/get_valid_container_id definitions - Add max_attempts safeguard to prevent infinite loops	2026-02-14 15:28:47 +01:00
CanbiZ (MickLesk)	cecadf5681	core: improve error reporting with structured error strings and better categorization + output formatting (#11907 ) * fix(telemetry): improve error reporting with structured error strings and better categorization - Add build_error_string() that creates structured format: 'exit_code=N \| description\n---\n<last 20 log lines>' - Fix categorize_error() to map ALL known exit codes: - Added: shell(1,2), proxmox(200-231), service(150-154), database(170-193), runtime(243-249), signal(139,141,143) - Split timeout from network (28 was in both) - Added DPKG(255) to dependency category - Update all API functions to use build_error_string(): post_update_to_api, post_update_to_api_extended, post_tool_to_api, post_addon_to_api - Add ensure_log_on_host() calls to on_exit, on_interrupt, on_terminate handlers to prevent race condition where telemetry reports before container log is pulled to host * fix(ui): improve error output formatting and remove redundant log paths - error_handler: Use msg_info/msg_ok/msg_warn for container cleanup instead of raw echo with manual ANSI codes - error_handler: Add ❓ icon before 'Remove broken container?' prompt - error_handler: Indent log output with TAB for visual consistency - build.func: Use msg_custom for installation log path display - build.func: Use msg_info → msg_ok for container removal flow - build.func: Use msg_warn for 'kept for debugging' message - core.func/vm-core.func: Remove redundant container-internal log path display (📋 View full log) since combined log on host is the canonical location shown after failure	2026-02-14 15:28:30 +01:00
CanbiZ (MickLesk)	1a9bbdd6e7	core: unified logging system with combined logs (#11761 ) * refactor(logging): unified logging system with combined logs - Add log_msg(), log_section(), strip_ansi() helper functions to core.func - Extend msg_ok, msg_error, msg_warn, msg_info, msg_custom to write to log file - Log container settings (default and advanced) to log file - Combine host creation log and container installation log on failure - Use app-specific log path: /tmp/{app}-{ctid}-{session}.log - Add timestamps and section headers in log files - Improve get_error_text() with combined log fallback chain - Add ensure_log_on_host() for trap handlers to pull logs before API reporting * linting	2026-02-14 13:41:09 +01:00
CanbiZ (MickLesk)	f23414a1a8	Retry reporting with fallback payloads (#11885 ) Enhance post_update_to_api to support a "force" mode and robust retry logic: add a 3rd-arg bypass to duplicate suppression, capture a short error summary, and perform up to three POST attempts (full payload, shortened error payload, minimal payload) with HTTP code checks and small backoffs. Mark POST_UPDATE_DONE on success (or after three attempts) to avoid infinite retries. Also invoke post_update_to_api with the "force" flag from cleanup paths in build.func and error_handler.func so a final status update is attempted after cleanup.	2026-02-13 13:49:49 +01:00
CanbiZ (MickLesk)	cc89cdbab1	Copy install log to host before API report Copy the container install log to a host path before reporting a failure to the telemetry API so get_error_text() can read it. Introduce host_install_log and point INSTALL_LOG to the host copy when pulled via pct, move post_update_to_api after the log copy, and update the displayed installation-log path.	2026-02-13 12:29:03 +01:00
CanbiZ (MickLesk)	406d53ea2f	core: remove old Go API and extend misc/api.func with new backend (#11822 ) * Remove Go API and extend misc/api.func Delete the Go-based API (api/main.go, api/go.mod, api/go.sum, api/.env.example) and significantly enhance misc/api.func. The shell telemetry file now includes telemetry configuration, repo source detection, GPU/CPU/RAM detection, expanded explain_exit_code mappings, and refactored post_to_api/post_to_api_vm to send non-blocking telemetry to telemetry.community-scripts.org while respecting DIAGNOSTICS/DEV_MODE and adding richer metadata (cpu/gpu/ram/repo_source). Also updates header/author info and improves privacy/robustness and error handling. * Start install timer and refine error reporting Call start_install_timer during build startup and overhaul exit/error reporting. Changes: - Invoke start_install_timer early in misc/build.func to track install duration. - Update api_exit_script comments to reference PocketBase/api.func and adjust ERR/SIGINT/SIGTERM traps to post numeric exit codes (use $? / 130 / 143) instead of command strings. - Replace the previous explain_exit_code implementation with a conditional fallback: only define explain_exit_code if not already provided (api.func is the canonical source). Expanded and reorganized exit code mappings (curl, timeout, systemd, Node/Python/Postgres/MySQL/MongoDB, Proxmox, etc.). - In error_handler: stop echoing the container log path (host shows combined log), and post a "failed" update to the API with the exit code before offering container cleanup. Rationale: these changes make telemetry more consistent and robust (numeric codes), provide a safe fallback for exit descriptions when api.func isn't loaded, and ensure failures are reported to the API prior to any automatic cleanup. * Report install start/failure to telemetry API Add telemetry hooks in misc/build.func: call post_to_api at installation start to capture early or immediately-failing installs, and call post_update_to_api with status "failed" and the install exit code when a container installation fails. This improves visibility into install failures for monitoring/telemetry.	2026-02-12 11:55:13 +01:00
ls-root	b53a731c42	fix(core): respect EDITOR variable for config editing (#11693 ) - Replace hardcoded nano with ${EDITOR:-nano} for file editing - Default to nano if no EDITOR enviornmet variable is set	2026-02-11 16:38:46 +01:00
CanbiZ (MickLesk)	1357a6f26e	fix(msg_menu): redirect menu display to /dev/tty to prevent capture in command substitution	2026-02-09 17:12:56 +01:00
CanbiZ (MickLesk)	7715a02f05	remove whiptail from update scripts for unattended update support (#11712 ) * Simplify Alpine update scripts to run upgrade Remove interactive whiptail menus, loops and newt dependency checks from ct/alpine-docker.sh, ct/alpine-zigbee2mqtt.sh, and ct/alpine.sh. Each update_script now simply calls header_info, runs $STD apk -U upgrade, displays a success message and exits, simplifying and automating the update flow. * feat(update-scripts): replace whiptail with msg_menu for unattended updates Remove all whiptail dialogs from ct update_script() functions and replace with msg_menu() - a lightweight read-based menu that supports: - PHS_SILENT=1: auto-selects first (default) option for unattended mode - Interactive: numbered menu with 10s timeout and default fallback Converted scripts (whiptail menu → msg_menu): - plex.sh, npmplus.sh, cronicle.sh, meilisearch.sh, node-red.sh - homeassistant.sh, podman-homeassistant.sh - vaultwarden.sh, alpine-vaultwarden.sh - loki.sh, alpine-loki.sh - alpine-grafana.sh, alpine-redis.sh, alpine-valkey.sh - alpine-nextcloud.sh Simplified scripts (removed unnecessary whiptail for single-action updates): - alpine.sh, alpine-docker.sh, alpine-zigbee2mqtt.sh Special handling: - gitea-mirror.sh: replaced yesno/msgbox with read -rp confirmations, exit 75 in silent mode for major version upgrades requiring interaction - vaultwarden.sh/alpine-vaultwarden.sh: passwordbox replaced with read -r -s -p, skipped in silent mode with warning - nginxproxymanager.sh: exit 1 → exit 75 for disabled script Infrastructure: - Added msg_menu() helper to misc/build.func - Added exit code 75 handling in update-apps.sh (skip, not fail) Closes #11620 * refactor(update-scripts): remove menus where sequential updates suffice - alpine-nextcloud: add apk upgrade as the update action (was missing) - meilisearch: run meilisearch + UI updates sequentially (like bar-assistant) - npmplus: run alpine upgrade + docker pull sequentially, no menu - vaultwarden: update VaultWarden + Web-Vault sequentially, remove admin token option (interactive-only, not suitable for unattended updates) - alpine-vaultwarden: just run apk upgrade, remove admin token menu	2026-02-09 11:05:31 +01:00
Chris	bb0188b38c	[Fix] build.func: QOL grammar adjustment for Creating LXC message (#11633 )	2026-02-06 20:55:35 +01:00
Michel Roegl-Brunner	47d63e92bf	build.func: Replace storage variable with searchdomain variable (#11322 )	2026-01-29 13:29:25 +01:00
Alexander	cca0d9e584	Fix whiptail menu loop when other interfaces are present (#11237 )	2026-01-28 17:35:24 +01:00
Lavacano	bac7f07a74	The added sed command s/^[- ]*// removes any leading dashes or spaces from the description (#11285 )	2026-01-28 17:31:26 +01:00
CanbiZ (MickLesk)	910723c745	Revert "fix(build): use pct mount to fix Debian 13 root ownership" This reverts commit `6267250e49`.	2026-01-28 14:54:30 +01:00
CanbiZ (MickLesk)	6267250e49	fix(build): use pct mount to fix Debian 13 root ownership	2026-01-28 14:54:15 +01:00
CanbiZ (MickLesk)	b35437c391	fix(build): fix Debian 13 root ownership from host before customization	2026-01-28 14:44:00 +01:00
CanbiZ (MickLesk)	ff4f5f6a0a	core: update dynamic values in LXC profile on update_motd_ip (#11268 ) * feat(build.func): update dynamic values in LXC profile on update_motd_ip - Updates only OS/Hostname/IP lines in /etc/profile.d/00_lxc-details.sh - Checks if values changed before updating (avoids unnecessary I/O) - Preserves user customizations (app name, GitHub link, custom lines) - Only updates if file exists and contains 'community-scripts' marker - Fixes outdated OS version display after in-place upgrades (e.g., Bookworm → Trixie) - Now reads OS name/version from /etc/os-release at login time Fixes community-scripts/ProxmoxVE issue with static MOTD after OS upgrade * add update_motd_ip in routine	2026-01-28 13:26:20 +01:00
CanbiZ (MickLesk)	fa29f74c31	feat(build.func): add nesting warning for systemd-based distributions (#11208 )	2026-01-26 22:21:09 +01:00
CanbiZ (MickLesk)	eb596d2364	core: add IPv6 fallback support to get_current_ip functions \| add check for SSH_KEYS_FILE in user_defaults (#11067 ) * fix(core): add IPv6 fallback support to get_current_ip functions Fixes IP detection in IPv6-only environments by adding fallback: - Try IPv4 first (existing behavior) - Fall back to IPv6 via interface lookup (eth0 scope global) - Fall back to IPv6 routing with Google/Cloudflare DNS targets Closes #issue-ipv6-only-environment * fix(build): use SSH_AUTHORIZED_KEY from user defaults when no SSH_KEYS_FILE exists When using 'User Defaults' with var_ssh_authorized_key set in default.vars, the SSH key was not being installed because install_ssh_keys_into_ct() only checked for SSH_KEYS_FILE (which is only created in Advanced Settings). Now the function also checks SSH_AUTHORIZED_KEY and creates a temporary SSH_KEYS_FILE if needed. Fixes #11062	2026-01-23 09:05:17 +01:00
CanbiZ (MickLesk)	7395a44277	Run TypeScript compilation in Joplin Server scripts Added 'yarn run tsc' to both update and install scripts for Joplin Server to ensure TypeScript sources are compiled. Also removed an unused variable from build.func for code cleanup.	2026-01-21 13:07:20 +01:00
CanbiZ (MickLesk)	fe2384a2fa	Initialize variables in create_lxc_container Added initialization for ONLINE_TEMPLATE and ONLINE_TEMPLATES arrays at the start of the create_lxc_container function to ensure variables are defined before use.	2026-01-21 12:42:19 +01:00
CanbiZ (MickLesk)	5384adf0c3	Remove unnecessary set +u/set -u in create_lxc_container only as test exist	2026-01-21 12:40:55 +01:00
CanbiZ (MickLesk)	ed18776710	fix unbound var: ONLINE_TEMPLATES	2026-01-21 12:39:57 +01:00
CanbiZ (MickLesk)	88557d53f4	core: allow empty tags & improve template search (#11020 )	2026-01-21 12:01:34 +01:00
CanbiZ (MickLesk)	cb2141ebe2	Revert "Revert "core: add retry logic for template lock in LXC container crea…" (#11013 ) This reverts commit `7699f4f6ad`.	2026-01-20 23:41:53 +01:00
CanbiZ (MickLesk)	7699f4f6ad	Revert "core: add retry logic for template lock in LXC container creation (#1…" (#11011 ) This reverts commit `d71f24bddb`.	2026-01-20 23:35:39 +01:00
CanbiZ (MickLesk)	d71f24bddb	core: add retry logic for template lock in LXC container creation (#11002 )	2026-01-20 22:36:32 +01:00
CanbiZ (MickLesk)	6dd5fbd7da	core: add input validations for several functions (#10995 )	2026-01-20 15:03:14 +01:00

1 2 3 4 5 ...

329 Commits