Infrastructure-as-Code (IaC) blueprint
Here is the exact step-by-step procedure for provisioning a brand new hardware box:
Phase 1: Base Operating System Installation
Flash the OS: Download the Ubuntu Server 24.04 LTS ISO from the official Ubuntu website and flash it to a bootable USB drive (using a tool like Rufus or BalenaEtcher). Install Ubuntu: Boot the new hardware from the USB drive. During the installation: Select the default Ubuntu Server (do not install a desktop environment/GUI; it consumes valuable RAM and VRAM). Ensure you check the box to Install OpenSSH server so you can connect to it remotely. Set up your primary networking (Ethernet or Wi-Fi).
Phase 2: Execute the Appliance Blueprint
Transfer the Script: Securely copy the provision_appliance.sh script to your new server using SCP or SFTP: Execute as Root: SSH into the new server, make the script executable, and run it with superuser privileges: Reboot: Once the script completes successfully, reboot the server (sudo reboot). This ensures the machine boots with the newly built Linux Kernel modules, the proprietary NVIDIA drivers, and the optimized Wi-Fi 7 network stack (sysctl changes).
Phase 3: Application Code Deployment
Deploy the Backend: Clone your FastAPI Python backend code into the directory specified by the blueprint: /opt/orphey/backend. Deploy the Database/Cache Subsystems: Ensure PostgreSQL and Redis are initialized. Configure your Qdrant vector database storage paths. Download AI Weights: Download your target LLM weights (e.g., Llama 3) for the vLLM engine to load into the RTX 5090's VRAM.
Commentaires
Enregistrer un commentaire