(originally posted here: https://zig.news/liyu1981/play-with-new-comptime-var-rule-of-zig-0120-333k)
zig 0.12.0
is released, WoW! But as it is still on road to 1.0, there are plenty of breaking changes. One of them I want to explore more here is Comptime Memory Changes
. Or simply I call it new comptime var rule 🙂 It is a change will help zig
going to better next phase of incremental compiling while eliminating some tricky bugs according to the release note.
First what has changed in simple words?
I encourage you to read the official release note, but if you are hurry, simply below code will not compile in 0.12.0
(but it will compile in 0.11.0
)
pub fn main() !u8 {
comptime var x: u32 = 123; // mark 1
var ptr: *const u32 = undefined;
ptr = &x;
if (ptr.* != 123) return error.TestFailed;
return 0;
}
The reason for this will not compile because ptr
is a runtime known item, while x
is a comptime know variable, and make ptr
point to x
now is not allowed. Further reason is, if allowing this happening, because x
is variable, so zig
will not know when to stop comptime execution, so that zig
will not know when can start generating runtime code for ptr
becuase it depends on x
‘s comptime.
The solution is let us break the tie, tell zig
exactly when x
is finalized, like something below (see mark 1 changes). A small note is that when write a comptime const, we have to move the comptime
keyword to right hand value side and manually specify the type.
pub fn main() !u8 {
const x = comptime @as(u32, 123); // mark 1 var ptr: *const u32 = undefined;
ptr = &x; // mark 1
if (ptr.* != 123) return error.TestFailed;
return 0;
}
A more practical example and the pattern
Dealing with integer can illustrate the problem but will never the case in real life problem solving. So let us look at below example
const std = @import("std");
fn formatName(writer: anytype, name: anytype) !void {
const fmt = comptime brk: {
var fmt_buf: [32 * 4]u8 = undefined;
var fmt_len: usize = 0;
for (0..name.len) |i| {
if (i != 0) {
fmt_buf[fmt_len] = ' ';
fmt_len += 1;
}
@memcpy(fmt_buf[fmt_len .. fmt_len + 3], "{s}");
fmt_len += 3;
}
break :brk fmt_buf[0..fmt_len];
};
try writer.print(fmt, name);
}
pub fn main() !u8 {
var buf = std.ArrayList(u8).init(std.heap.page_allocator);
defer buf.deinit();
try formatName(buf.writer(), .{ "Samus", "Aran" });
std.debug.print("{s}\n", .{buf.items});
return 0;
}
The code should be simple enough to understand. It formats a name tuple by constructing a format string based on comptime known name tuple. But if we compie this, we will see
/home/yli/.asdf/installs/zig/0.12.0/lib/std/io.zig:324:48: error: runtime value contains reference to comptime var
return @errorCast(self.any().print(format, args));
^~~~~~
/home/yli/.asdf/installs/zig/0.12.0/lib/std/io.zig:324:48: note: comptime var pointers are not available at runtime
example3.zig:19:21: note: called from here
try writer.print(fmt, name);
~~~~~~~~~~~~^~~~~~~~~~~
referenced by:
main: example3.zig:45:19
callMain: /home/yli/.asdf/installs/zig/0.12.0/lib/std/start.zig:511:32
remaining reference traces hidden; use '-freference-trace' to see all reference traces
The last trace zig
gives us is that fmt
is referencing comptime var, but which var
, it does not say. So we have to look back into the comptime block generating fmt
, to figure out that it should be problem of fmt_buf
.
The how to fix this, seems not that obvious, because fmt_buf
is an array with capacity larger than its content, so though looks like stupid, copy again fmt_buf
will work. So fix above code like below
const std = @import("std");
fn formatName(writer: anytype, name: anytype) !void {
const fmt = comptime brk: {
var fmt_buf: [32 * 4]u8 = undefined;
var fmt_len: usize = 0;
for (0..name.len) |i| {
if (i != 0) {
fmt_buf[fmt_len] = ' ';
fmt_len += 1;
}
@memcpy(fmt_buf[fmt_len .. fmt_len + 3], "{s}");
fmt_len += 3;
}
var fmt_final: [fmt_len]u8 = undefined; // mark 1
for (0..fmt_len) |i| fmt_final[i] = fmt_buf[i];
break :brk fmt_final;
};
try writer.print(&fmt, name);
}
pub fn main() !u8 {
var buf = std.ArrayList(u8).init(std.heap.page_allocator);
defer buf.deinit();
try formatName(buf.writer(), .{ "Samus", "Aran" });
std.debug.print("{s}\n", .{buf.items});
return 0;
}
The extra copy to fmt_final
code start from mark 1, will help zig
to realize that it can throw fmt_buf
and finalized with fmt_final
as it is the only thing returned outside that block. after that everything is const known in comptime so it will be happy to continue.
With this example, we may have observed a pattern: if we need to compute something dynamically in comptime (so that we will use var
), there will need a finalizing phase like what we did above: copy the dynamic var to const. Release notes also gave good explanation of this pattern, but hopefully with above example, it helps us all to understand in practical scenario.
Next let us look at an even tricky example
example when zig
will not point us to the obvious spot of problem
const std = @import("std");
fn getFirstName() []const u8 {
comptime var buf: [5]u8 = undefined;
@memcpy(&buf, "hello");
return &buf;
}
fn getLastName() []const u8 {
comptime var buf: [5]u8 = undefined;
@memcpy(&buf, "world");
const final_name = buf;
return &final_name;
}
const Name = struct {
first: []const u8,
last: []const u8,
};
fn getName() []const []const u8 {
comptime {
var names: [2][]const u8 = undefined;
names[0] = getFirstName();
names[1] = getLastName();
const names_const = names;
return &names_const;
}
}
pub fn main() !u8 {
std.debug.print("{s} {s}\n", .{ getName()[0], getName()[1] });
return 0;
}
The final example is another simple program try to generate a Name
struct in compile time and then print it. You may ask why it makes sense? It is a simplified version of program generating complex structs in comptime, like genearting database table definition info from struct which will later be used for helping generating SQL. But with that scenario there is unnecessary noises so I created this example.
Compile above example we will get:
example2.zig:27:9: error: function called at runtime cannot return value at comptime
return &names_const;
^~~~~~~~~~~~~~~~~~~
referenced by:
main: example2.zig:32:51
callMain: /home/yli/.asdf/installs/zig/0.12.0/lib/std/start.zig:511:32
remaining reference traces hidden; use '-freference-trace' to see all reference traces
This time, zig
has pointed names_const
referencing comptime variables, but what we did is following the above pattern: we create an array, copy the values then assign to a const and return. So we have to follow the trace to find out var names
is the problem, and then follow to names[0]
which points to getFirstName
and then inside getFirstName
find actually must be the buf
we returned violated the comptime var rule. And then we can fix the problem like below
// only show getFirstName here
fn getFirstName() []const u8 {
comptime var buf: [5]u8 = undefined;
@memcpy(&buf, "hello");
const final_name = buf;
return &final_name;
}
zig
is not pointing us to the real location of where is wrong probably because when we use the array to store the values in comptime, it has lost the stack (because of array). I do not know whether this should be fixed (seems not very necessary), but it is confusing when we see above compile errors as it has not gave us the correct location. Hopefully with this example, this kind of error can be more easily addressed as we now know how to trace.
That’s all for now in playing with zig
‘s new comptime var rule. Hopefully this will help you!