起因#

1
#include <stdio.h>
2

3
int main(void)
4
{
5
    int sum = 0, i = 0;
6
    char input[5];
7

8
    while (1) {
9
        sum = 0;
10
        scanf("%s", input);
11
        for (i = 0; input[i] != '\0'; i++)
12
            sum = sum*10 + input[i] - '0';
13
        printf("input=%d\n", sum);
14
    }
15
    return 0;
16
}

在浏览Linux C编程一站式学习第十章 gdb 3. 观察点时遇到了上述代码。这段代码目的很简单：把从输入设备输入的整数字符串转换为整数并输出。原文采用了以下调试步骤发现了问题：

1
$ ./main
2
123
3
input=123
4
67
5
input=67
6
12345
7
input=123407

原文的解释是：在内存中，局部变量i紧跟在input[4]后，所以input[5]指的就是局部变量i。而从键盘输入了12345，分别给input的各个元素赋值，便成了：

1
input[0] = '1';
2
input[1] = '2';
3
input[2] = '3';
4
input[3] = '4';
5
input[4] = '5';
6
i = '\0';

这里的 i 被赋值为 '\0'，因为键盘读取了一行字符串，而字符串以'\0'结尾。C 语言的scanf()函数不会读取空白字符，所以末尾不包含\n。

注意到：

11
for (i = 0; input[i] != '\0'; i++)

for循环的控制条件是input[i] != '\0'，而这个数组并不包含'\0'，因此出现了访问越界的情况。

原文使用了 GDB 调试，部分调试信息如下：

1
(gdb) n
2
11      for (i = 0; input[i] != '\0'; i++)
3
(gdb) p sum
4
$3 = 12345
5
(gdb) n
6
12        sum = sum*10 + input[i] - '0';
7
(gdb) x/7b input
8
0xbfb8f0a7:  0x31  0x32  0x33  0x34  0x35  0x05  0x00

i后面一个地址位置的值是0x00，因此最后一次循环执行了 12345*10 + 0x05 - '\0'，得到了 123407 。

然而，事情真的有这么简单吗？

发现问题#

我在使用 CLion 调试上述代码，正常得很！

1
12345
2
input=12345
3
114514
4
input=114514
5
1919810
6
input=1919810
7
^C

怎么回事呢？

看了一眼原文的上一页，发现原作者是这么编译运行的：

1
$ gcc main.c -g -o main
2
$ ./main

行吧，我也这么办。

？

怪事，一切正常啊。难道这个 bug 这么没有鲁棒性？

这时我有点摸不着头脑，看一眼我的 GCC 和 Linux 版本：

1
$ uname -a
2
Linux sakimidare-arch 6.16.2-arch1-1 #1 SMP PREEMPT_DYNAMIC Wed, 20 Aug 2025 21:43:45 +0000 x86_64 GNU/Linux
3

4
$ gcc --version
5
gcc (GCC) 15.2.1 20250813
6
Copyright © 2025 Free Software Foundation, Inc.
7
本程序是自由软件；请参看源代码的版权声明。本软件没有任何担保；
8
包括没有适销性和某一专用目的下的适用性担保。

好嘛，再试试其他编译器呢？总不会这个 Bug 到现代机器上不复存在了吧……

1
$ clang main.c -o main
2
$ ./main
3
12345
4
input=123407
5
666666
6
input=666617
7
123456789
8
input=123407
9
114514
10
input=114467

问题出现了！有时出错的代码比正确而不可靠的代码更有意义。

经常听到有人调侃道：代码跑起来就不要去管它了。这似乎是生产环境下的无奈之举。毕竟人是要吃饭的，谁也不可能抱着一段正常运行的代码研究一辈子。如果把时间全用来研究一段代码，程序带来的效率收益与投入的时间成本相比可能并不划算。

但与生产不一样的是，我们是在学习。在学习一门语言，理解一门语言时如果不深入了解每一行代码的意义，用“程序跑起来就不用管它”来麻痹自己，那么程序里必然有我们发现不了的隐患，生产中有所谓“我这边能跑啊，你那边环境没配对吧”的借口，学习中有知其然而不知其所以然的糟糕态度。

这些看似微不足道的怠惰悄悄构成了思维上的不完备，让我们不习惯于全面地研究问题。长年累月下去，只会追悔莫及。

为什么会这样？#

好了好了扯远啦，我们来看看为什么这两种编译器编译出来的程序有不同的行为。

我们知道，从一个 .c 源代码文件到可执行程序共分为四步：

预处理 (Preprocessing)
编译 (Compilation)
汇编 (Assembling)
链接 (Linking)

0. 预处理#

预处理是指

处理 #include 展开头文件；
处理 #define 替换宏；
处理条件编译指令 #if #ifdef #ifndef；
删除注释。

用 gcc 和 clang 分别预处理这个 main.c，看看生成的预处理文件有什么不一样吧！

1
$ gcc -E main.c -o gcc.i
2
$ clang -E main.c -o clang.i

发现main()函数部分代码一样，都是：

1
int main(void)
2
{
3
    int sum = 0, i = 0;
4
    char input[5];
5

6
    while (1) {
7
        sum = 0;
8
        scanf("%s", input);
9
        for (i = 0; input[i] != '\0'; i++)
10
            sum = sum*10 + input[i] - '0';
11
        printf("input=%d\n", sum);
12
    }
13
    return 0;
14
}

这也符合我们的认知，因为预处理只是进行了替换操作，不涉及修改函数的逻辑。

在main()上方有八百多行代码，两个编译器处理后的文件不一样。不过我们先不去管它，因为这个问题出现的主要原因是 input 数组和 i 的位置相邻。根据我们的直觉，问题不在头文件。

先把这两个文件放在一边，我们继续。

1. 编译#

NOTE
此处的汇编语言是 x86-64 GNU 汇编语言，Windows 无法直接运行。

编译是指把预处理后的 C 代码翻译成汇编代码。这一步包括语法检查、语义分析、优化等，于是我们有理由怀疑编译器在这一步做了不一样的操作，导致汇编逻辑不一样。

出发吧！

1
$ gcc -S gcc.i -o gcc.s
2
$ clang -S clang.i -o clang.s

把这两个文件都贴出来：

1
  .file  "main.c"
2
  .text
3
  .section  .rodata
4
.LC0:
5
  .string  "%s"
6
.LC1:
7
  .string  "input=%d\n"
8
  .text
9
  .globl  main
10
  .type  main, @function
11
main:
12
.LFB0:
13
  .cfi_startproc
14
  pushq  %rbp
15
  .cfi_def_cfa_offset 16
16
  .cfi_offset 6, -16
17
  movq  %rsp, %rbp
18
  .cfi_def_cfa_register 6
19
  subq  $32, %rsp
20
  movq  %fs:40, %rax
21
  movq  %rax, -8(%rbp)
22
  xorl  %eax, %eax
23
  movl  $0, -24(%rbp)
24
  movl  $0, -20(%rbp)
25
.L4:
26
  movl  $0, -24(%rbp)
27
  leaq  -13(%rbp), %rax
28
  leaq  .LC0(%rip), %rdx
29
  movq  %rax, %rsi
30
  movq  %rdx, %rdi
31
  movl  $0, %eax
32
  call  __isoc23_scanf@PLT
33
  movl  $0, -20(%rbp)
34
  jmp  .L2
35
.L3:
36
  movl  -24(%rbp), %edx
37
  movl  %edx, %eax
38
  sall  $2, %eax
39
  addl  %edx, %eax
40
  addl  %eax, %eax
41
  movl  %eax, %edx
42
  movl  -20(%rbp), %eax
43
  cltq
44
  movzbl  -13(%rbp,%rax), %eax
45
  movsbl  %al, %eax
46
  addl  %edx, %eax
47
  subl  $48, %eax
48
  movl  %eax, -24(%rbp)
49
  addl  $1, -20(%rbp)
50
.L2:
51
  movl  -20(%rbp), %eax
52
  cltq
53
  movzbl  -13(%rbp,%rax), %eax
54
  testb  %al, %al
55
  jne  .L3
56
  movl  -24(%rbp), %eax
57
  leaq  .LC1(%rip), %rdx
58
  movl  %eax, %esi
59
  movq  %rdx, %rdi
60
  movl  $0, %eax
61
  call  printf@PLT
62
  jmp  .L4
63
  .cfi_endproc
64
.LFE0:
65
  .size  main, .-main
66
  .ident  "GCC: (GNU) 15.2.1 20250813"
67
  .section  .note.GNU-stack,"",@progbits

1
  .file  "main.c"
2
  .text
3
  .globl  main                            # -- Begin function main
4
  .p2align  4
5
  .type  main,@function
6
main:                                   # @main
7
  .cfi_startproc
8
# %bb.0:
9
  pushq  %rbp
10
  .cfi_def_cfa_offset 16
11
  .cfi_offset %rbp, -16
12
  movq  %rsp, %rbp
13
  .cfi_def_cfa_register %rbp
14
  subq  $32, %rsp
15
  movl  $0, -4(%rbp)
16
  movl  $0, -8(%rbp)
17
  movl  $0, -12(%rbp)
18
.LBB0_1:                                # =>This Loop Header: Depth=1
19
                                        #     Child Loop BB0_2 Depth 2
20
  movl  $0, -8(%rbp)
21
  leaq  -17(%rbp), %rsi
22
  leaq  .L.str(%rip), %rdi
23
  movb  $0, %al
24
  callq  __isoc99_scanf@PLT
25
  movl  $0, -12(%rbp)
26
.LBB0_2:                                #   Parent Loop BB0_1 Depth=1
27
                                        # =>  This Inner Loop Header: Depth=2
28
  movslq  -12(%rbp), %rax
29
  movsbl  -17(%rbp,%rax), %eax
30
  cmpl  $0, %eax
31
  je  .LBB0_5
32
# %bb.3:                                #   in Loop: Header=BB0_2 Depth=2
33
  imull  $10, -8(%rbp), %eax
34
  movslq  -12(%rbp), %rcx
35
  movsbl  -17(%rbp,%rcx), %ecx
36
  addl  %ecx, %eax
37
  subl  $48, %eax
38
  movl  %eax, -8(%rbp)
39
# %bb.4:                                #   in Loop: Header=BB0_2 Depth=2
40
  movl  -12(%rbp), %eax
41
  addl  $1, %eax
42
  movl  %eax, -12(%rbp)
43
  jmp  .LBB0_2
44
.LBB0_5:                                #   in Loop: Header=BB0_1 Depth=1
45
  movl  -8(%rbp), %esi
46
  leaq  .L.str.1(%rip), %rdi
47
  movb  $0, %al
48
  callq  printf@PLT
49
  jmp  .LBB0_1
50
.Lfunc_end0:
51
  .size  main, .Lfunc_end0-main
52
  .cfi_endproc
53
                                        # -- End function
54
  .type  .L.str,@object                  # @.str
55
  .section  .rodata.str1.1,"aMS",@progbits,1
56
.L.str:
57
  .asciz  "%s"
58
  .size  .L.str, 3
59

60
  .type  .L.str.1,@object                # @.str.1
61
.L.str.1:
62
  .asciz  "input=%d\n"
63
  .size  .L.str.1, 10
64

65
  .ident  "clang version 20.1.8"
66
  .section  ".note.GNU-stack","",@progbits
67
  .addrsig
68
  .addrsig_sym __isoc99_scanf
69
  .addrsig_sym printf

来看看gcc.s：

12
.LFB0:
13
  .cfi_startproc
14
  pushq  %rbp
15
  .cfi_def_cfa_offset 16
16
  .cfi_offset 6, -16
17
  movq  %rsp, %rbp
18
  .cfi_def_cfa_register 6
19
  subq  $32, %rsp           # 预留 32 字节栈帧给局部变量
20
  movq  %fs:40, %rax
21
  movq  %rax, -8(%rbp)
22
  xorl  %eax, %eax
23
  movl  $0, -24(%rbp)       # sum = 0;
24
  movl  $0, -20(%rbp)       # i = 0;
25
.L4:
26
  movl  $0, -24(%rbp)       # sum = 0;
27
  leaq  -13(%rbp), %rax     # input[0] 的位置在-13(%rbp)
28
    ...

再看看clang.s是如何处理的：

6
main:                                   # @main
7
  .cfi_startproc
8
# %bb.0:
9
  pushq  %rbp
10
  .cfi_def_cfa_offset 16
11
  .cfi_offset %rbp, -16
12
  movq  %rsp, %rbp
13
  .cfi_def_cfa_register %rbp
14
  subq  $32, %rsp       # 预留 32 字节栈帧给局部变量
15
  movl  $0, -4(%rbp)    # 返回值临时保留位，本程序未使用
16
  movl  $0, -8(%rbp)    # sum = 0;
17
  movl  $0, -12(%rbp)   # i = 0;
18
.LBB0_1:                                # =>This Loop Header: Depth=1
19
                                        #     Child Loop BB0_2 Depth 2
20
  movl  $0, -8(%rbp)    # sum = 0;
21
  leaq  -17(%rbp), %rsi # input[0] 的位置在-17(%rbp)
22
    ...

好啦，这下就清楚了！

我们来画一下栈：

GCC 栈#

位置	变量
-9	input[4]
-10	input[3]
-11	input[2]
-12	input[1]
-13	input[0]
…	…
-20	i
-24	sum

Clang 栈#

位置	变量
-8	sum
-12	i
-13	input[4]
-14	input[3]
-15	input[2]
-16	input[1]
-17	input[0]

因此，我们得出了结论：

GCC为i和input[0]之间留足了栈帧，并且input[4]之后也没有变量可以影响循环，因此没出问题。 而Clang让input[4]和i紧靠在一起，增加了数组越界的风险。

2. 汇编#

3. 链接#

哎呀这两个标题和本文没关系，加上只是为了目录更好看（

验证猜想#

我们用 GCC 看看input[-7]？按道理就是i了吧！修改程序为

1
#include <stdio.h>
2

3
int main(void)
4
{
5
    int sum = 0, i = 0;
6
    char input[5];
7

8
    while (1) {
9
        sum = 0;
10
        scanf("%s", input);
11
        for (i = 0; input[i] != '\0'; i++)
12
            sum = sum*10 + input[i] - '0';
13
        printf("input=%d\n", sum);
14
        printf("%d\n", input[-7]);
15
    }
16
    return 0;
17
}

运行程序：

1
$ gcc test.c -o test
2
$ ./test
3
12345
4
input=12345
5
5

大功告成！果然，input[-7] 就是 i！

如何规避风险？#

-O0了吗？-fsanitize=address了吗？快加上！

一位群友如是说。好吧好吧，我们加上这两个参数再编译一次试试：

1
$ gcc -O0 -fsanitize=address test.c -o test
2
$ ./test
3
1234567
4
=================================================================
5
==16748==ERROR: AddressSanitizer: stack-buffer-overflow on address 0x7b4c18f00025 at pc 0x7f4c1ba6e51d bp 0x7ffdce2744c0 sp 0x7ffdce273c48
6
WRITE of size 8 at 0x7b4c18f00025 thread T0
7
    #0 0x7f4c1ba6e51c in scanf_common /usr/src/debug/gcc/gcc/libsanitizer/sanitizer_common/sanitizer_common_interceptors_format.inc:342
8
    #1 0x7f4c1ba8edee in __isoc23_vscanf /usr/src/debug/gcc/gcc/libsanitizer/sanitizer_common/sanitizer_common_interceptors.inc:1554
9
    #2 0x7f4c1ba8f5f5 in __isoc23_scanf /usr/src/debug/gcc/gcc/libsanitizer/sanitizer_common/sanitizer_common_interceptors.inc:1584
10
    #3 0x564e475b9254 in main (/home/sakimidare/CLionProjects/c_study/test+0x1254) (BuildId: 35fabe8824d13d3c1ca4e2836107a3b16992a4a9)
11
    #4 0x7f4c1b627674  (/usr/lib/libc.so.6+0x27674) (BuildId: 4fe011c94a88e8aeb6f2201b9eb369f42b4a1e9e)
12
    #5 0x7f4c1b627728 in __libc_start_main (/usr/lib/libc.so.6+0x27728) (BuildId: 4fe011c94a88e8aeb6f2201b9eb369f42b4a1e9e)
13
    #6 0x564e475b90d4 in _start (/home/sakimidare/CLionProjects/c_study/test+0x10d4) (BuildId: 35fabe8824d13d3c1ca4e2836107a3b16992a4a9)
14

15
Address 0x7b4c18f00025 is located in stack of thread T0 at offset 37 in frame
16
    #0 0x564e475b91b8 in main (/home/sakimidare/CLionProjects/c_study/test+0x11b8) (BuildId: 35fabe8824d13d3c1ca4e2836107a3b16992a4a9)
17

18
  This frame has 1 object(s):
19
    [32, 37) 'input' (line 6) <== Memory access at offset 37 overflows this variable
20
HINT: this may be a false positive if your program uses some custom stack unwind mechanism, swapcontext or vfork
21
      (longjmp and C++ exceptions *are* supported)
22
SUMMARY: AddressSanitizer: stack-buffer-overflow (/home/sakimidare/CLionProjects/c_study/test+0x1254) (BuildId: 35fabe8824d13d3c1ca4e2836107a3b16992a4a9) in main
23
Shadow bytes around the buggy address:
24
  0x7b4c18effd80: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
25
  0x7b4c18effe00: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
26
  0x7b4c18effe80: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
27
  0x7b4c18efff00: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
28
  0x7b4c18efff80: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
29
=>0x7b4c18f00000: f1 f1 f1 f1[05]f3 f3 f3 00 00 00 00 00 00 00 00
30
  0x7b4c18f00080: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
31
  0x7b4c18f00100: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
32
  0x7b4c18f00180: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
33
  0x7b4c18f00200: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
34
  0x7b4c18f00280: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
35
Shadow byte legend (one shadow byte represents 8 application bytes):
36
  Addressable:           00
37
  Partially addressable: 01 02 03 04 05 06 07
38
  Heap left redzone:       fa
39
  Freed heap region:       fd
40
  Stack left redzone:      f1
41
  Stack mid redzone:       f2
42
  Stack right redzone:     f3
43
  Stack after return:      f5
44
  Stack use after scope:   f8
45
  Global redzone:          f9
46
  Global init order:       f6
47
  Poisoned by user:        f7
48
  Container overflow:      fc
49
  Array cookie:            ac
50
  Intra object redzone:    bb
51
  ASan internal:           fe
52
  Left alloca redzone:     ca
53
  Right alloca redzone:    cb
54
==16748==ABORTING

看得出来，加上参数确实有助于规避数组越界风险。

不过，最有效的方法还是事先考虑好所有情况，防范任何可能出现的 Bug！~~（酒吧点炒饭.txt）~~

写在最后#

这是我第一次写这种类型的文章，算是对自己独立解决问题能力的一次检验吧！

~~也不知道会不会有人看这篇文章，当作日记得了。~~ 如果有人看到这里，感谢大家阅读！