I am writing code that needs to format a string, and I want to avoid buffer overruns.
I know that if vsnprintf
is available (C99 onwards) we can do:
char* formatString(const char *format, ...)
{
char* result = NULL;
va_list ap;
va_start(ap, format);
/* Get the size of the formatted string by getting vsnprintf return the
* number of remaining characters if we ask it to write 0 characters */
int size = vsnprintf(NULL, 0, format, ap);
if (size > 0)
{
/* String formatted just fine */
result = (char *) calloc(size + 1, sizeof(char));
vsnprintf(result, size + 1, format, ap);
}
va_end(ap);
return result;
}
I can't figure out a way of doing something similar in C90 (without vsnprintf
). If it turns out to not be possible without writing extremely complex logic I'd be happy to set a maximum length for the result, but I'm not sure how that could be achieved either without risking a buffer overrun.
Pre-C99 affords no simply solution to format strings with a high degree of safety of preventing buffer overruns.
It is those pesky "%s"
, "%[]"
, "%f"
format specifiers that require so much careful consideration with their potential long output. Thus the need for such a function. @Jonathan Leffler
To do so with those early compilers obliges code to analyze format
and the arguments to find the required size. At that point, code is nearly there to making you own complete my_vsnprintf()
. I'd seek existing solutions for that. @user694733.
Even with C99, there are environmental limits for *printf()
.
The number of characters that can be produced by any single conversion shall be at least 4095. C11dr §7.21.6.1 15
So any code that tries to char buf[10000]; snprintf(buf, sizeof buf, "%s", long_string);
risks problems even with a sufficient buf[]
yet with strlen(long_string) > 4095
.
This implies that a quick and dirty code could count the %
and the format length and make the reasonable assumption that the size needed does not exceed:
size_t sz = 4095*percent_count + strlen(format) + 1;
Of course further analysis of the specifiers could lead to a more conservative sz
. Continuing down this path we end at writing our own my_vsnprintf()
.
Even with your own my_vsnprintf()
the safety is only so good. There is no run-time check that the format
(which may be dynamic) matches the following arguments. To do so requires a new approach.
Cheeky self advertisement for a C99 solution to insure matching specifiers and arguments: Formatted print without the need to specify type matching specifiers using _Generic.