Skip to content

Commit 0d3dab5

Browse files
committed
2024-10 sdarwin summary
1 parent 129c5a3 commit 0d3dab5

File tree

1 file changed

+68
-0
lines changed

1 file changed

+68
-0
lines changed

_posts/2024-10-21-SamsQ3Update.md

Lines changed: 68 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,68 @@
1+
---
2+
layout: post
3+
nav-class: dark
4+
categories: sam
5+
title: Cloud and Infrastructure Update Q3 2024
6+
author-id: sam
7+
---
8+
9+
### Boost release process boostorg/release-tools
10+
11+
In the previous quarter, publish_release.py included features to support the Fastly CDN at archives.boost.io. This quarter, that functionality was put into action with the release of Boost 1.86.0, and it was a chance to fine-tune and improve the script. More error checking. Adding a preflight phase to test SSH. Adjusting the publish_release.py script to stage windows executables for Tom Kent, so they are relocated to a publicly visible folder during a release.
12+
Generally the file upload target is AWS S3, and from there the CDN origin servers download the archives.
13+
Revamping build_docs scripts: add python venv to mac and windows. Support macos-14.
14+
Briefly investigating docca - python issue that affected boost builds.
15+
Deployed 24.04 version of docker image for the main boostorg/boost jobs. Fixed zip and 7z failures appearing on 24.04.
16+
17+
### Boost website boostorg/website-v2
18+
19+
Composed a cost inventory spreadsheet of the new infrastructure. Debugging an outage of the site that was traced back to redis -> django-health-check -> celery. Frank Wiles from Revsys ultimately solved this puzzle by adjusting a celery configuration variable.
20+
21+
Wrote local website development bootstrap scripts that will install all prerequisites for local development and then even launch docker-compose. Versions for mac, windows, linux. Updated the corresponding documentation, such that it's equally feasible to go through the steps manually to see what's being installed and then launch docker-compose from the command-line.
22+
23+
Researched how to selectively purge the Fastly CDN cache - and specifically applying that to /release/ on boost.io.
24+
25+
### Mailman project
26+
27+
Revising runbook (steps to go-live in the future).
28+
Incorporated Greg's changes to install a boost.io header on the mailing lists. Reduced and consolidated those files. Created ansible deployment for that feature. Now temporarily removing customizations before deployment. They may be returned later.
29+
Upgraded the operating system on all mm3 test instances. Tested. Switched the search engine from Elastic Search to Xapian, which is better supported in terms of the Django modules.
30+
Submitted upstream pull requests (merged) which
31+
- document an improved Xapian installation method
32+
- further document how the core and web components interact in terms of the db
33+
34+
### Slack
35+
36+
Discussing slackbot implementations with Kenneth.
37+
38+
### wowbagger
39+
40+
Documented and published issues about the various problems on the legacy web server.
41+
Contributed to Kenneth's docker-compose strategy for the original boost.org website to allow local development via docker-compose, by downloading boost archives so the local environment is functional.
42+
Worked with Rob to include Plausible analytics on boost.org.
43+
44+
### Jenkins
45+
46+
Investigate/repair doc builds of mrdocs, http_proto.
47+
Modify doc builds of beast, url.
48+
Generated PR doc builds of safe-cpp. Added a GHA step to render the html upon commit.
49+
Upgraded the Jenkins executable, and plugins.
50+
Set up previews of boostlook. master/develop and PRs.
51+
52+
### JSON Benchmarks
53+
54+
After experimenting on a Hetzner server, switched JSON Benchmarks to a new Xeon processor from KnownHost. Intel core processors are aimed at the consumer market while Xeon is a server architecture and is more consistent when running benchmark tests. Replaced Jenkins runner, canceled previous server.
55+
56+
### GHA
57+
58+
Debugging certain boost library jobs. Also, with the hosted runners, determined there was a systematic problem that the bootprocess was timing out too quickly. Adjusted terraform settings and redeployed. Would be good to propose a PR upstream to terraform: the timeout is too fast.
59+
Enabled billing for math-cuda gpu tests.
60+
Macminivault billing issues.
61+
Resizing terraform runner Windows 2022, to add 30GB more disk space, and Pagefile (memory).
62+
Built new Ubuntu runners, newer kernel, adjusting OS versions on boostorg/unordered GHA to fix sanitizers.
63+
64+
### Drone
65+
66+
Assisted developers in debugging jobs.
67+
Scripted docker image cleanup on drone instances.
68+
Installed a cron job to clear the autoscaler, solving an issue that occasionally jobs get stuck in pending mode, preventing the instances from scaling.

0 commit comments

Comments
 (0)