patches and low-level development discussion
 help / color / mirror / code / Atom feed
From: Alyssa Ross <hi@alyssa.is>
To: Demi Marie Obenour <demiobenour@gmail.com>
Cc: Spectrum OS Development <devel@spectrum-os.org>
Subject: Re: [PATCH v4] Generate file lists from a script
Date: Thu, 25 Sep 2025 13:22:55 +0200	[thread overview]
Message-ID: <87tt0qbwts.fsf@alyssa.is> (raw)
In-Reply-To: <20250921-genfiles-v4-1-4375bda78707@gmail.com>

[-- Attachment #1: Type: text/plain, Size: 5074 bytes --]

Demi Marie Obenour <demiobenour@gmail.com> writes:

> Right now, the makefiles in host/rootfs, vm/sys/net, and img/app have
> manually-maintained lists of files and symlinks.  These duplicate the
> information in the git repository and can easily get out of sync or
> cause unnecessary merge conflicts.  Fix all of these issues by having
> the git repository be the source of truth, and using a script to
> generate the file lists.  Developers can regenerate the lists before
> every commit, or even add a git hook to do that.
>
> Signed-off-by: Demi Marie Obenour <demiobenour@gmail.com>

> diff --git a/scripts/genfiles.awk b/scripts/genfiles.awk
> new file mode 100644
> index 0000000000000000000000000000000000000000..891ad162ea9748e275f7a048db3acbd9e7895a9b
> --- /dev/null
> +++ b/scripts/genfiles.awk
> @@ -0,0 +1,90 @@
> +# SPDX-License-Identifier: EUPL-1.2+
> +# SPDX-FileCopyrightText: 2021-2024 Alyssa Ross <hi@alyssa.is>
> +# SPDX-FileCopyrightText: 2025 Demi Marie Obenour <demiobenour@gmail.com>
> +BEGIN {
> +	RS = "\n";
> +	FS = "\t";
> +	modes["120000"] = "symlink";
> +	modes["100644"] = "regular";
> +	modes["100755"] = "regular";
> +}
> +
> +function fail(msg) {
> +	exit_code = 1;

This line doesn't do anything now, right?

> +	print msg > "/dev/stderr";
> +	exit 1;
> +}
> +
> +done { fail("Junk after DONE", 1); }
> +
> +$0 == "DONE" {
> +	done = 1;
> +	next;
> +}
> +
> +# Extract data from built-in variables.
> +{
> +	filename = $2;
> +	raw_mode = $1;
> +	# awk autocreates empty string entries if the key is invalid,
> +	# but the code exits in this case so that is okay.
> +	mode = modes[raw_mode];
> +}
> +
> +filename !~ /^[[:alnum:]_.+@/-]+$/ {
> +	fail("filename '" filename "' has forbidden characters");
> +}
> +
> +# Skip license files
> +/\.license$/ { next }
> +
> +filename ~ /^image\/etc\/s6-rc\// {
> +	if (mode != "regular") {
> +		fail("s6-rc-compile input '" filename "' isn't a regular file");
> +	}
> +	rc_files[rc_count++] = filename;
> +	next;
> +}
> +
> +mode == "symlink" {
> +	symlinks[symlink_count++] = filename;
> +	next;
> +}
> +
> +mode == "regular" {
> +	files[file_count++] = filename;
> +	next;
> +}
> +
> +{ fail("File '" filename "' is not regular file or symlink (mode " raw_mode ")"); }
> +
> +END {
> +	if (exit_code) {
> +		exit exit_code;
> +	}
> +	if (!done) {
> +		fail("Did not receive DONE line");
> +	}
> +	printf ("# SPDX-License-Identifier: EUPL-1.2+\n" \
> +"# SPDX-FileCopyrightText: 2021-2024 Alyssa Ross <hi@alyssa.is>\n" \
> +"# Generated by scripts/genfile.sh.  Any changes will be overwritten.\n" \
> +"FILES =") > out_file;
> +	for (array_index = 0; array_index < file_count; array_index += 1) {
> +		printf " \\\n\t%s", files[array_index] > out_file;
> +	}
> +	printf ("\n\n" \
> +"# These are separate because they need to be included, but putting\n" \
> +"# them as make dependencies would confuse make.\n" \
> +"LINKS =") > out_file;
> +	for (array_index = 0; array_index < symlink_count; array_index += 1) {
> +		printf " \\\n\t%s", symlinks[array_index] > out_file;
> +	}
> +	printf "\n\nS6_RC_FILES =" > out_file;
> +	for (array_index = 0; array_index < rc_count; array_index += 1) {
> +		printf " \\\n\t%s", rc_files[array_index] > out_file;
> +	}
> +	print > out_file;
> +	if (close(out_file)) {
> +		fail("Cannot close output file: " ERRNO);
> +	}
> +}
> diff --git a/scripts/genfiles.sh b/scripts/genfiles.sh
> new file mode 100755
> index 0000000000000000000000000000000000000000..65e8b56654448f4c9529e00807e68adb0bcfefbf
> --- /dev/null
> +++ b/scripts/genfiles.sh
> @@ -0,0 +1,28 @@
> +#!/bin/sh --
> +set -euo pipefail
> +export LC_ALL=C LANGUAGE=C
> +# shell strips trailing newlines, so add something after the newline
> +dir=$(git rev-parse --show-toplevel && echo a)
> +cd "${dir%'
> +a'}"

What's this for?  In case the directory name ends with a newline?  All
sorts of stuff is going to break if somebody decides to do this.  We
don't need to go out of our way to accomodate it.

> +for i in host/rootfs img/app vm/sys/net; do
> +    output_file=$i/file-list.mk
> +    {
> +	git -C "$i" -c core.quotePath=true ls-files $'--format=%(objectmode)\t%(path)' -- image |
> +	sort -t $'\t' -k 2
> +	echo DONE

I still don't understand what the DONE is for.  Can you describe a
circumstance in which it would be necessary?

> +    } |
> +    awk -v "out_file=$output_file.tmp" -f scripts/genfiles.awk

This was unresolved from last time too.  This could just be stdout and a
simpler awk script.  If you really want to make sure a temporary file
isn't left around if something goes wrong, you could trap EXIT, but it's
also just really not a big deal.

> +    if [ -f "$output_file" ]; then
> +	    # Avoid changing output file if it is up to date, as that
> +	    # would cause unnecessary rebuilds.
> +	    if cmp -s -- "$output_file.tmp" "$output_file"; then
> +		    rm -- "$output_file.tmp"
> +		    continue
> +	    else
> +		    astatus=$?
> +		    if [ "$astatus" != 1 ]; then exit "$astatus"; fi
> +	    fi
> +    fi
> +    mv -- "$output_file.tmp" "$output_file"
> +done

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 227 bytes --]

  reply	other threads:[~2025-09-25 11:23 UTC|newest]

Thread overview: 51+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-09-04  1:56 [PATCH 0/4] Generate file lists from a script Demi Marie Obenour
2025-09-04  1:56 ` [PATCH 1/4] Move all files for the image into a subdirectory Demi Marie Obenour
2025-09-04  1:56 ` [PATCH 2/4] Generate makefile file lists from a script Demi Marie Obenour
2025-09-08  9:59   ` Alyssa Ross
2025-09-08 18:45     ` Demi Marie Obenour
2025-09-09 14:51       ` Alyssa Ross
2025-09-04  1:56 ` [PATCH 3/4] Common make rules for building erofs images Demi Marie Obenour
2025-09-08 10:01   ` Alyssa Ross
2025-09-08 18:53     ` Demi Marie Obenour
2025-09-09 14:56       ` Alyssa Ross
2025-09-04  1:56 ` [PATCH 4/4] Use /etc/s6-rc/compiled for compiled s6-rc directory Demi Marie Obenour
2025-09-10  5:29 ` [PATCH v2 0/3] Generate file lists from a script Demi Marie Obenour
2025-09-10  5:29   ` [PATCH v2 1/3] Move all files for the image into a subdirectory Demi Marie Obenour
2025-09-10 18:58     ` Alyssa Ross
2025-09-11 12:21       ` Demi Marie Obenour
2025-09-10  5:29   ` [PATCH v2 2/3] Generate makefile file lists from a script Demi Marie Obenour
2025-09-10  5:29   ` [PATCH v2 3/3] Common make rules for building erofs images Demi Marie Obenour
2025-09-11 12:47   ` [PATCH v3 0/4] Generate file lists from a script Demi Marie Obenour
2025-09-11 12:47     ` [PATCH v3 1/4] Do not ignore errors from tar Demi Marie Obenour
2025-09-17 11:48       ` Alyssa Ross
2025-09-18  2:45         ` Demi Marie Obenour
2025-09-19  7:46           ` Alyssa Ross
2025-09-30 12:59             ` Alyssa Ross
2025-09-19  7:55       ` Alyssa Ross
2025-09-19 19:03         ` Demi Marie Obenour
2025-09-11 12:47     ` [PATCH v3 2/4] Move all files for the image into a subdirectory Demi Marie Obenour
2025-09-17 12:30       ` Alyssa Ross
2025-09-17 12:39       ` Alyssa Ross
2025-09-17 13:03       ` Alyssa Ross
2025-09-11 12:47     ` [PATCH v3 3/4] Generate makefile file lists from a script Demi Marie Obenour
2025-09-11 12:47     ` [PATCH v3 4/4] Common make rules for building erofs images Demi Marie Obenour
2025-09-21  2:23   ` [PATCH v3] Generate file lists from a script Demi Marie Obenour
2025-09-21  8:47     ` Alyssa Ross
2025-09-21 16:51       ` Demi Marie Obenour
2025-09-21 17:07         ` Alyssa Ross
2025-09-21 17:24     ` [PATCH v4] " Demi Marie Obenour
2025-09-25 11:22       ` Alyssa Ross [this message]
2025-09-26 16:31       ` [PATCH v5] " Demi Marie Obenour
2025-09-27  8:19         ` Alyssa Ross
2025-09-27  8:42           ` Demi Marie Obenour
2025-09-27 16:22         ` [PATCH v6] " Demi Marie Obenour
2025-09-29  8:12           ` Alyssa Ross
2025-09-29 17:20             ` Demi Marie Obenour
2025-09-29 17:18           ` [PATCH v7] " Demi Marie Obenour
2025-10-01  9:20             ` Alyssa Ross
2025-10-01  9:24               ` Demi Marie Obenour
2025-10-01  9:35             ` Alyssa Ross
2025-10-01 18:30             ` [PATCH v8] " Demi Marie Obenour
2025-10-02  9:46               ` Alyssa Ross
2025-10-02 17:37               ` [PATCH v9] " Demi Marie Obenour
2025-10-03  9:04                 ` Alyssa Ross

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87tt0qbwts.fsf@alyssa.is \
    --to=hi@alyssa.is \
    --cc=demiobenour@gmail.com \
    --cc=devel@spectrum-os.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://spectrum-os.org/git/crosvm
	https://spectrum-os.org/git/doc
	https://spectrum-os.org/git/mktuntap
	https://spectrum-os.org/git/nixpkgs
	https://spectrum-os.org/git/spectrum
	https://spectrum-os.org/git/ucspi-vsock
	https://spectrum-os.org/git/www

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).