patches and low-level development discussion
 help / color / mirror / code / Atom feed
From: Alyssa Ross <hi@alyssa.is>
To: Demi Marie Obenour <demiobenour@gmail.com>
Cc: Spectrum OS Development <devel@spectrum-os.org>
Subject: Re: [PATCH 2/4] Generate makefile file lists from a script
Date: Mon, 08 Sep 2025 11:59:54 +0200	[thread overview]
Message-ID: <87zfb5i7s5.fsf@alyssa.is> (raw)
In-Reply-To: <20250903-genfiles-v1-2-cc993fcb1e4c@gmail.com>

[-- Attachment #1: Type: text/plain, Size: 8633 bytes --]

Demi Marie Obenour <demiobenour@gmail.com> writes:

> The script will always get them right, whereas humans (the author of
> this commit included) generally will not.
>
> Signed-off-by: Demi Marie Obenour <demiobenour@gmail.com>

I like the idea!

> ---
>  Documentation/development/built-in-vms.adoc |   7 ++
>  host/rootfs/Makefile                        | 107 ++------------------------
>  host/rootfs/file-list.mk                    | 100 ++++++++++++++++++++++++
>  img/app/Makefile                            |  74 +++---------------
>  img/app/file-list.mk                        |  65 ++++++++++++++++
>  lib/common.mk                               |   1 +
>  scripts/genfiles.awk                        | 115 ++++++++++++++++++++++++++++
>  vm/sys/net/Makefile                         |  51 +++---------
>  vm/sys/net/file-list.mk                     |  41 ++++++++++
>  9 files changed, 357 insertions(+), 204 deletions(-)
>
> diff --git a/Documentation/development/built-in-vms.adoc b/Documentation/development/built-in-vms.adoc
> index e90009ee5a3c2c254a7ae11e36121576b819eee7..82d78705a6020bbdb06fbc123a32dbdd6fd50085 100644
> --- a/Documentation/development/built-in-vms.adoc
> +++ b/Documentation/development/built-in-vms.adoc
> @@ -44,6 +44,13 @@ NOTE: As a special convenience, it's not necessary to run `make clean`
>  if the only change to the Nix files is modifying the packages
>  installed in the VM.
>  
> +The list of files used for the VM image is stored in a separate file,
> +`file-lists.mk`.  You can edit it manually when developing.  However,
> +after each commit that adds or removes files from it, you should run
> +`make update-file-list`, which will regenerate it from the output of
> +`git ls-files`.  Any changes you made will be lost.  This ensures
> +that the file lists are always in sync with the git repository.
> +

TBH editing it manually and then losing your changes is probably going
to be more of a footgun than anything else.  You can get the same result
by staging new files and rerunning the script, so I'd avoid mentioning
the manual editing option, especially given you have a "DO NOT EDIT"
comment.

> -$(dest): ../../scripts/make-erofs.sh $(PACKAGES_FILE) $(addprefix image/,$(FILES)) $(BUILD_FILES) build/empty build/fifo
> +$(dest): ../../scripts/make-erofs.sh $(PACKAGES_FILE) $(addprefix image/,$(FILES)) $(BUILD_FILES) build/empty build/fifo file-list.mk

Given that we don't include Makefile as a dependency, it probably
doesn't make sense to depend on other included Makefile fragments
either?

> @@ -207,6 +111,11 @@ debug:
>  	    $(VMLINUX)
>  .PHONY: debug
>  
> +update-file-list:
> +	../../scripts/genfiles.awk image > file-list.mk
> +
> +.PHONY: update-file-list
> +

Given this doesn't use any features of Make, it probably makes more
sense to just run the script directly.  It could output into
file-list.mk by default for ergonomics.

>  run: build/live.img $(EXT_FS) build/rootfs.verity.roothash
>  	@set -x && \
>  	ext="$$(mktemp build/spectrum-rootfs-extfs.XXXXXXXXXX.img)" && \
> diff --git a/host/rootfs/file-list.mk b/host/rootfs/file-list.mk
> new file mode 100644
> index 0000000000000000000000000000000000000000..0817887d0bb25ab47e777f6a130a3b6214b25f0f
> --- /dev/null
> +++ b/host/rootfs/file-list.mk
> @@ -0,0 +1,100 @@
> +# SPDX-License-Identifier: CC0-1.0
> +# SPDX-FileCopyRightText: Not Copyrightable (machine-written)

SPDX-FileCopy*r*ightText, and should probably say that you're the owner,
for consistency with e.g. lib/nixpkgs.default.nix, which is also
generated.  You at least made the template.

> +# Generated by scripts/genfile.awk, DO NOT EDIT!
> +override FILES ::= \

Our Makefiles are POSIX.  (Mostly because it's the only sensible way to
draw the line, and keep all the really advanced easy to misuse GNU stuff
out.)

> diff --git a/lib/common.mk b/lib/common.mk
> index 277c3544036d9a9057f8ba4ad37fe2207548cc59..0a03ff440cc671264d2b859a2ae048db9252d047 100644
> --- a/lib/common.mk
> +++ b/lib/common.mk
> @@ -1,5 +1,6 @@
>  # SPDX-License-Identifier: EUPL-1.2+
>  # SPDX-FileCopyrightText: 2021, 2023, 2025 Alyssa Ross <hi@alyssa.is>
> +# SPDX-FileCopyrightText: 2025 Demi Marie Obenour <demiobenour@gmail.com>
>  
>  BACKGROUND = background
>  CPIO = cpio

This change looks like an accident?

> diff --git a/scripts/genfiles.awk b/scripts/genfiles.awk
> new file mode 100755
> index 0000000000000000000000000000000000000000..62863e78f157f1d9a0f6dbdb0f4380db9c9d48cb
> --- /dev/null
> +++ b/scripts/genfiles.awk
> @@ -0,0 +1,115 @@
> +#!/usr/bin/env -S LC_ALL=C LANGUAGE=C awk -E
> +# SPDX-License-Identifier: EUPL-1.2+
> +# SPDX-FileCopyrightText: 2025 Demi Marie Obenour <demiobenour@gmail.com>
> +function check_status(status) {
> +	if (status < 0) {
> +		printf "FATAL: getline: %s\n", status > "/dev/stderr";
> +		exit 1;
> +	}
> +	return status;
> +}
> +
> +function check_close(value,    status) {
> +	status = check_status(close(value));
> +	if (status != 0) {
> +		printf "FATAL: command exited with status %d\n", status > "/dev/stderr";
> +		exit status;
> +	}
> +}
> +
> +function shell_quote(command) {
> +	gsub(/'/, "'\\\\&'", command);
> +	return ("'" command "'");
> +}
> +
> +function get(command,          line, path, array_index, inode_type, mode, modes, symlink_count, symlinks, file_count, files, rc_count, rc_files, is_license, is_rc) {
> +	file_count = 0;
> +	symlink_count = 0;
> +	rc_count = 0;
> +	modes["120000"] = "symlink";
> +	modes["040644"] = "directory";
> +	modes["040755"] = "directory";
> +	modes["100644"] = "regular";
> +	modes["100755"] = "regular";
> +	print "# SPDX-License-Identifier: CC0-1.0";
> +	print "# SPDX-FileCopyRightText: Not Copyrightable (machine-written)";
> +	print "# Generated by scripts/genfile.awk, DO NOT EDIT!";
> +	while (check_status(command | getline line)) {
> +		if (line !~ /^[0-7]{6}\t/) {
> +			# this is a git bug
> +			print "FATAL: git ls-files output didn't start with a valid mode" > "/dev/stderr";
> +			exit 1;
> +		}
> +		path = substr(line, 8);
> +		if (path !~ /^[ -~]+$/) {
> +			# also a git bug
> +			print "FATAL: git ls-files didn't quote properly" > "/dev/stderr";
> +			exit 1;
> +		}
> +		if (path ~ /^\/|((^|\/)\.{0,2}($|\/))/) {
> +			# also a git bug
> +			printf "FATAL: git ls-files output non-canonical path '%s'\n", path > "/dev/stderr";
> +			exit 1;
> +		}
> +		if (path !~ /^[[:alnum:]_.+@/-]+$/) {
> +			printf "FATAL: filename '%s' has forbidden characters\n", path > "/dev/stderr";
> +			exit 1;
> +		}

I feel like this could be a lot nicer if we ran git ls-files outside
awk, and could then use its nice top-level matching syntax?

> +		mode = modes[substr(line, 1, 6)];
> +		is_license = path ~ /\.license$/;
> +		is_rc = path ~ /^etc\/s6-rc\//;
> +		if (mode == "regular") {
> +			if (is_license) {
> +				continue;
> +			}
> +			if (is_rc) {
> +				rc_count += 1;
> +				rc_files[rc_count] = path;
> +			} else {
> +				file_count += 1;
> +				files[file_count] = path;
> +			}
> +			continue;
> +		}
> +		if (mode == "symlink") {
> +			if (is_rc) {
> +				printf "FATAL: symlink in s6-rc-compile input: %s\n", path;
> +				exit 1;
> +			}
> +			symlink_count += 1;
> +			symlinks[symlink_count] = path;
> +		} else if (mode != "directory") {
> +			printf "FATAL: file %s has unknown mode %s\n", path, substr(line, 1, 6) > "/dev/stderr";
> +			exit 1;
> +		}
> +		if (is_license) {
> +			printf "FATAL: %s (type %s) ends in .license\n", path, mode > "/dev/stderr";
> +			exit 1;
> +		}
> +	}
> +	check_close(command);
> +
> +	printf "override FILES ::=";
> +	for (array_index = 1; array_index <= file_count; array_index += 1) {
> +		printf " \\\n\t%s", files[array_index];
> +	}
> +	printf ("\n\n" \
> +"# These are separate because they need to be included, but putting\n" \
> +"# them as make dependencies would confuse make.\n" \
> +"override LINKS ::=");
> +	for (array_index = 1; array_index <= symlink_count; array_index += 1) {
> +		printf " \\\n\t%s", symlinks[array_index];
> +	}
> +	printf "\n\noverride S6_RC_FILES ::=";
> +	for (array_index = 1; array_index <= rc_count; array_index += 1) {
> +		printf " \\\n\t%s", rc_files[array_index];
> +	}
> +	printf "\n"
> +}
> +
> +BEGIN {
> +	RS = "\n";
> +	FS = "\t";
> +	get("set -euo pipefail && { git -c core.quotePath=true -C " shell_quote(ARGV[1]) " ls-files '--format=%(objectmode)\t%(path)' -- .|sort -t '\t' -k 2;}");
> +	exit 0;
> +}

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 227 bytes --]

  reply	other threads:[~2025-09-08 10:00 UTC|newest]

Thread overview: 51+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-09-04  1:56 [PATCH 0/4] Generate file lists from a script Demi Marie Obenour
2025-09-04  1:56 ` [PATCH 1/4] Move all files for the image into a subdirectory Demi Marie Obenour
2025-09-04  1:56 ` [PATCH 2/4] Generate makefile file lists from a script Demi Marie Obenour
2025-09-08  9:59   ` Alyssa Ross [this message]
2025-09-08 18:45     ` Demi Marie Obenour
2025-09-09 14:51       ` Alyssa Ross
2025-09-04  1:56 ` [PATCH 3/4] Common make rules for building erofs images Demi Marie Obenour
2025-09-08 10:01   ` Alyssa Ross
2025-09-08 18:53     ` Demi Marie Obenour
2025-09-09 14:56       ` Alyssa Ross
2025-09-04  1:56 ` [PATCH 4/4] Use /etc/s6-rc/compiled for compiled s6-rc directory Demi Marie Obenour
2025-09-10  5:29 ` [PATCH v2 0/3] Generate file lists from a script Demi Marie Obenour
2025-09-10  5:29   ` [PATCH v2 1/3] Move all files for the image into a subdirectory Demi Marie Obenour
2025-09-10 18:58     ` Alyssa Ross
2025-09-11 12:21       ` Demi Marie Obenour
2025-09-10  5:29   ` [PATCH v2 2/3] Generate makefile file lists from a script Demi Marie Obenour
2025-09-10  5:29   ` [PATCH v2 3/3] Common make rules for building erofs images Demi Marie Obenour
2025-09-11 12:47   ` [PATCH v3 0/4] Generate file lists from a script Demi Marie Obenour
2025-09-11 12:47     ` [PATCH v3 1/4] Do not ignore errors from tar Demi Marie Obenour
2025-09-17 11:48       ` Alyssa Ross
2025-09-18  2:45         ` Demi Marie Obenour
2025-09-19  7:46           ` Alyssa Ross
2025-09-30 12:59             ` Alyssa Ross
2025-09-19  7:55       ` Alyssa Ross
2025-09-19 19:03         ` Demi Marie Obenour
2025-09-11 12:47     ` [PATCH v3 2/4] Move all files for the image into a subdirectory Demi Marie Obenour
2025-09-17 12:30       ` Alyssa Ross
2025-09-17 12:39       ` Alyssa Ross
2025-09-17 13:03       ` Alyssa Ross
2025-09-11 12:47     ` [PATCH v3 3/4] Generate makefile file lists from a script Demi Marie Obenour
2025-09-11 12:47     ` [PATCH v3 4/4] Common make rules for building erofs images Demi Marie Obenour
2025-09-21  2:23   ` [PATCH v3] Generate file lists from a script Demi Marie Obenour
2025-09-21  8:47     ` Alyssa Ross
2025-09-21 16:51       ` Demi Marie Obenour
2025-09-21 17:07         ` Alyssa Ross
2025-09-21 17:24     ` [PATCH v4] " Demi Marie Obenour
2025-09-25 11:22       ` Alyssa Ross
2025-09-26 16:31       ` [PATCH v5] " Demi Marie Obenour
2025-09-27  8:19         ` Alyssa Ross
2025-09-27  8:42           ` Demi Marie Obenour
2025-09-27 16:22         ` [PATCH v6] " Demi Marie Obenour
2025-09-29  8:12           ` Alyssa Ross
2025-09-29 17:20             ` Demi Marie Obenour
2025-09-29 17:18           ` [PATCH v7] " Demi Marie Obenour
2025-10-01  9:20             ` Alyssa Ross
2025-10-01  9:24               ` Demi Marie Obenour
2025-10-01  9:35             ` Alyssa Ross
2025-10-01 18:30             ` [PATCH v8] " Demi Marie Obenour
2025-10-02  9:46               ` Alyssa Ross
2025-10-02 17:37               ` [PATCH v9] " Demi Marie Obenour
2025-10-03  9:04                 ` Alyssa Ross

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87zfb5i7s5.fsf@alyssa.is \
    --to=hi@alyssa.is \
    --cc=demiobenour@gmail.com \
    --cc=devel@spectrum-os.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://spectrum-os.org/git/crosvm
	https://spectrum-os.org/git/doc
	https://spectrum-os.org/git/mktuntap
	https://spectrum-os.org/git/nixpkgs
	https://spectrum-os.org/git/spectrum
	https://spectrum-os.org/git/ucspi-vsock
	https://spectrum-os.org/git/www

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).