- 
                Notifications
    You must be signed in to change notification settings 
- Fork 13.3k
Closed
Labels
Milestone
Description
I think we may need to add memory barrier to xt_rsil().
Based on the comments I saw here:
From ESP8266_RTOS_SDK/components/esp8266/include/xtensa/xtruntime.h
/*  NOTE: these asm macros don't modify memory, but they are marked
 *  as such to act as memory access barriers to the compiler because
 *  these macros are sometimes used to delineate critical sections;
 *  function calls are natural barriers (the compiler does not know
 *  whether a function modifies memory) unless declared to be inlined.  */
# define XTOS_SET_INTLEVEL(intlevel)            ({ unsigned __tmp; \
                        __asm__ __volatile__(   "rsil   %0, " XTSTR(intlevel) "\n" \
                                                : "=a" (__tmp) : : "memory" ); \
                        __tmp;})After trying to get a better understanding of the "memory" parameter. I think the current xt_rsil() in
Arduino/cores/esp8266/Arduino.h
Line 162 in cd6cf98
| #define xt_rsil(level) (__extension__({uint32_t state; __asm__ __volatile__("rsil %0," __STRINGIFY(level) : "=a" (state)); state;})) | 
needs it.
I wanted to see the results in assembly. So I did a quick compile to .S of this sample function with the old xt_rsil() and one with "memory" specified:
#include <Arduino.h>
#if 0
#undef xt_rsil
#define xt_rsil(level) (__extension__({uint32_t state; __asm__ __volatile__("rsil %0," __STRINGIFY(level) : "=a" (state)::"memory"); state;}))
#endif
int just4Fun(int *a, int *b) {
  uint32_t old_ps;
  int result = *a;
  old_ps = xt_rsil(15);
  *b = *a;
  xt_wsr_ps(old_ps);
  return result;
}Using xt_rsil() without "memory" fence, trimmed .S results:
_Z8just4FunPiS_:
        l32i.n  a2, a2, 0
        rsil a4,15
        s32i.n  a2, a3, 0
        wsr a4,ps; isync
        ret.nUsing xt_rsil() with "memory" fence, trimmed .S results:
_Z8just4FunPiS_:
        l32i.n  a4, a2, 0
        rsil a5,15
        l32i.n  a2, a2, 0
        s32i.n  a2, a3, 0
        wsr a5,ps; isync
        mov.n   a2, a4
        ret.nNotice the change.
- The first example reads *aoutside the critical area. Then saves that value to*bwhile in the critical area.
- The second example reads a copy of *aoutside the critical area. However, while in the critical area it reads*aagain before saving it to*b.
Originally posted by @mhightower83 in #6274 (comment)