Unable to step into C standard library source code

Asked by Matthias Kläy

Hi,

I teach how to write firmware, currently using Simplicity Studio on EFM32WG990 Cortex-M4F based MCUs. For educational reasons, I would like to demonstrate to my audience what exactly happens between the hardware reset and the call to main().

I have already spent approx. 20 hours trying to achieve this with Simplicity Studio on an EFM32 STK3800, with no success so far. The compacted case conversation with Silicon Labs is summarized at https://www.silabs.com/community/mcu/32-bit/forum.topic.html/searching_for_c_libr-Y1V4. Silicon Labs has concluded that this is rather an issue of the GNU Arm toolchain than Simplicity Studio, which made me create a case at https://developer.arm.com/open-source/gnu-toolchain/gnu-rm, where I got redirected here.

All three so far (me, SiLabs, Arm) concluded that it *should* be possible, but none of us is able to make it happen. I assume that the issue is either...
...missing debug information in crt0.o/libc.a/... of the GNU Arm distribution, or...
...a general configuration issue in Simplicity Studio / Eclipse, or...
...a configuration issue in the project specific debug configuration in Simplicity Studio / Eclipse, or...
...not possible at all? Can hardly believe so.
Guidance on how to configure Simplicity Studio / Eclipse to step into crt0.S and .c source files (located in a local workspace folder) of the C standard library objects (located at C:\SiliconLabs\SimplicityStudio\v4\developer\toolchains\gnu_arm\7.2_2017q4\arm-none-eabi\lib) is highly appreciated.

Best regards,
Matthias

Question information

Language:
English Edit question
Status:
Solved
For:
GNU Arm Embedded Toolchain Edit question
Assignee:
No assignee Edit question
Solved by:
Matthias Kläy
Solved:
Last query:
Last reply:
Revision history for this message
john (jkovach) said :
#1

For a long time now I've been providing my own startup code. Something like this:

#include <string.h>

extern char __etext, __data_start__, __data_end__, __bss_start__, __bss_end__;
extern int main();

void __attribute((used, noreturn))
Reset_Handler(void)
{
    // copy-init variables
    memcpy(&__data_start__, &__etext, &__data_end__ - &__data_start__);
    // zero-init variables
    memset(&__bss_start__, 0, &__bss_end__ - &__bss_start__);
    (void)main();
    for (;;) ;
}

Obviously, you can easily step into it. Besides, it demonstrates that there is really nothing special to it. After all, one of the stated design goals behind Cortex-M is that you can run it without writing a single line in assembly.

Revision history for this message
Matthias Kläy (matthias-klaey) said :
#2

Hi John,

While this may be a feasible approach for some projects, this is a no-go for teaching. Why?
- Students would have to modify new projects that were created by the IDE, replacing something vendor provided by something hacked.
- Startup may do more than just data init and bss zero. Take a look at e.g. C:\SiliconLabs\SimplicityStudio\v4\developer\sdks\gecko_sdk_suite\v2.3\platform\Device\SiliconLabs\EFM32WG\Source\GCC\startup_efm32wg.S and you will find a bunch of interrupt handlers. The file is 341 lines long. And for C++, somebody would also have to call the initializers of static objects. And yes, if done with some precaution, this is OK even on embedded systems.
- Replacing something vendor provided by something hacked may be the solution in some situations. But in this case, including from an economic point of view (i.e. cost of verification, maintenance,...) this make-or-buy-decision is a very simple one: Keep what's provided by the vendor as long as it works fine. And, hacking to work around a tool configuration issue isn't a very wise decision either.

Summarizing:
This cannot be the final answer for my question I am attempting to get a solution for.

Best regards,
Matthias

Revision history for this message
Thomas Preud'homme (thomas-preudhomme) said :
#3

Hi Matthias,

What is exactly the problem you are facing? What happen when you put a breakpoint on _mainCRTStartup?

Best regards,

Thomas

Revision history for this message
Thomas Preud'homme (thomas-preudhomme) said :
#4

Hi Matthias,

What is exactly the problem you are facing? What happen when you put a breakpoint on _mainCRTStartup?

Best regards,

Thomas

Revision history for this message
Thomas Preud'homme (thomas-preudhomme) said :
#5

Fix status after hitting the wrong button

Revision history for this message
Matthias Kläy (matthias-klaey) said :
#6

Hi Thomas,

Answer to #1:

I cannot locate _mainCRTStartup in my project, so I cannot simply put a breakpoint there, at least not without some trick which I am not yet aware of.

Answer to #2:

I have a standard Simplicity Studio workspace with a standard Simplicity Studio project.
 - standard = based on the GNU ARM toolchain and the Gecko SDK
 - GNU ARM toolchain installed at C:\SiliconLabs\SimplicityStudio\v4\developer\toolchains\gnu_arm\7.2_2017q4
 - Gecko SDK installed at C:\SiliconLabs\SimplicityStudio\v4\developer\sdks\gecko_sdk_suite\v2.3

The GNU ARM toolchain only contains .a / .o / .h files, e.g. \arm-none-eabi\lib\crt0.o and libc_nano.a, but does not contain source code. Therefore, I have checked out the source code of GNU ARM 7.2_2017q4 to a local working directory, incl. e.g. D:\<Workspace>\newlib_of_gnu-arm-7.2_2017q4\libc\sys\arm\crt0.S.

So all should be set for stepping into crt0.S as well as standard C library source like e.g. memset(). At first, debugging in Simplicity Studio / Eclipse starts fine, execution stops at Reset_Handler: in startup_gcc_efm32wg.s (line #144). Stepping and/or resuming also works fine to bl __START (which is defined to _start) in startup_gcc_efm32wg.s (line #267). But then, stepping leaves the source code and only shows the disassembly. Eclipse shows no "cannot locate source code" dialog, adding the Newlib source to [Debug Configurations > Source] doesn't help, activating [Windows > Preferences > C/C++ > Debug > Show source files in binaries] doesn't help either. *I don't mange telling Eclipse to step through the crt0.S and standard C library source like e.g. memset()*. As a consequence, I never get to _mainCRTStartup either.

Best regards,
Matthias

Revision history for this message
Thomas Preud'homme (thomas-preudhomme) said :
#7

Some string routines and startup code is written in assembly so showing assembly might be perfectly normal. Does it show a filename for that assembly? Perhaps step does not work for assembly code, in which case you can try stepi but of course that won't show you more than the assembly.

Best regards,

Thomas

Revision history for this message
Matthias Kläy (matthias-klaey) said :
#8

Hi Thomas,

I appreciate your support, and at the same time ask you reading my explanations accuratly enough:

"Perhaps step does not work for assembly code"
Of course it does:
"Stepping and/or resuming also works fine to bl __START (which is defined to _start) in startup_gcc_efm32wg.s (line #267)"

"showing assembly might be perfectly normal"
Of course it is, but I stated:
"only shows the disassembly"

"Does it show a filename for that assembly?"
Actually not. So maybe this is the root cause for the whole matter:

When opening C:\SiliconLabs\SimplicityStudio\v4\developer\toolchains\gnu_arm\7.2_2017q4\arm-none-eabi\lib\crt0.o in a text editor, I cannot find anything like "crt0.S". I am not an expert on binary formats, but I'd expect seeing the filename in plaintext somewhere.

However, when opening C:\SiliconLabs\SimplicityStudio\v4\developer\toolchains\gnu_arm\7.2_2017q4\arm-none-eabi\lib\libc_nano.a in a text editor, I do see file names like "aeabi_memset.c". So I'd expect Eclipse being able to locate that file in the configured source paths.

You are more into the GNU ARM toolchain than me. Could you verify, whether objects and libraries are indeed built with debug information included, such that Eclipse has a chance to locate related source files? Or could you tell me, how I can find out this, e.g. by using objdump? Also, could you confirm that this is something defined by the GNU ARM toolchain distribution itself? Or is this something a vendor like Silicon Labs can configure?

Thank you and best regards,
Matthias

Revision history for this message
Leo Havmøller (leh-p) said :
#9

The contents of an elf file can be displayed using readelf e.g.:
  arm-none-eabi-readelf.exe --all -wlLiaprmfFsoRt crtbegin.o > dump.txt
A file compiled with debug information would have sections like these:
  [626] .debug_info PROGBITS 00000000 011f47 03a80d 00 0 0 1
  [627] .debug_abbrev PROGBITS 00000000 04c754 0046eb 00 0 0 1
  [628] .debug_aranges PROGBITS 00000000 050e3f 0009c0 00 0 0 1
  [629] .debug_ranges PROGBITS 00000000 0517ff 000c80 00 0 0 1
  [630] .debug_line PROGBITS 00000000 05247f 007aae 00 0 0 1
  [631] .debug_str PROGBITS 00000000 059f2d 01164e 01 MS 0 0 1
  [632] .debug_frame PROGBITS 00000000 06b57c 0018c8 00 0 0 4
  [633] .debug_loc PROGBITS 00000000 06ce44 002de2 00 0 0 1
Dumping crtbegin.o and libgcc.a shows no debug sections.
The filename you found is just the filename, in the .symtab section.
Libs are generally built without debug information, and this seems to be consistent across different gcc target architectures.
My guess is that it is related to file size and distribution complexity. Different debuggers prefer different DWARF formats, and as such the toolchain would need to deliver libs built for each format.

Revision history for this message
Launchpad Janitor (janitor) said :
#10

This question was expired because it remained in the 'Open' state without activity for the last 15 days.

Revision history for this message
Matthias Kläy (matthias-klaey) said :
#11

Hmm... "without activity for the last 15 days" ?!?

Last activity = 2018-09-21
Today = 2018-09-28
                            -----------------
No activity = 7 days

Looks like one of the underlying algorithms needs an update ;-)

PS: I am about following up Leo's answer this weekend.

Revision history for this message
Matthias Kläy (matthias-klaey) said :
#12

Hi Leo,

Thank you for the hints on reading ELF. Based on my limited knowledge, I think I can confirm the libs in question indeed do not contain any debug information:

_PS C:\SiliconLabs\SimplicityStudio\v4\developer\toolchains\gnu_arm\7.2_2017q4\arm-none-eabi\bin> .\readelf.exe --debug-dump ..\lib\crt0.o > ..\lib\crt0-DumpDebug.txt_

The resulting [crt0-DumpDebug.txt] is an empty file.

_PS C:\SiliconLabs\SimplicityStudio\v4\developer\toolchains\gnu_arm\7.2_2017q4\arm-none-eabi\bin> .\readelf.exe --all ..\
lib\crt0.o > ..\lib\crt0-DumpAll.txt_

The resulting [crt0-DumpAll.txt] contains nothing with "debug".

Googling for "build gnu arm with debug info" led me to https://mcuoneclipse.com/2014/08/23/gnu-libs-with-debug-information-rebuilding-the-gnu-arm-libraries/. Before spending the time working through all this, let me ask whether you or somebody else on launchpad whether you have a more recent and ideally more compact HowTo describing hot to "build gnu arm with debug info" on a Windows machine.

Thanks & Best regards,
Matthias

Revision history for this message
Thomas Preud'homme (thomas-preudhomme) said :
#13

Hi Matthias,

If you want a build as close as possible to the one distributed on this space, I would recommend to use a Linux VM and follow the official build instructions found in the source tarball [1] after surrounding the task III-10 from build-toolchain.sh (/$HOST_NATIVE/strip_target_objects/) between "if false; then" and "fi".

[1] https://developer.arm.com/-/media/Files/downloads/gnu-rm/7-2018q2/gcc-arm-none-eabi-7-2018-q2-update-src.tar.bz2

Best regards.

Revision history for this message
Matthias Kläy (matthias-klaey) said :
#14

Hi Thomas,

Thanks for the HowTo, I'll follow that.

Let me add a final wish for GNU ARMs future:

Using an open source based library should also give all benefits of open source, such as being able to step through the code in the debugger. Even more so, given that commercial tool chains like IAR these days provide full standard library source code and the ability to step into it as part of their installation. So for the future, I wish that GNU ARM evolves into this direction, i.e. being *distributed with and without debug information*.

Best regards,
Matthias