Message in log "ma-start: no restart flag found, not restarting MA"
-
Hi,
I have small problem with mango, from time to time it stops to respond. After correlation of dates in watchlist and ma-script.log, I found that it happens when in ma-script.log occure messages:śro, 8 lut 2017, 03:18:24 CET ma-start: no restart flag found, not restarting MA śro, 8 lut 2017, 03:18:24 CET ma-start: MA done
I don't know why it happens? After that I have to restart mango application. Please help. Best regards.
-
Hi przemom,
The ma-script.log file should only have the output from the Mango/bin scripts in it, which will not tell you why Mango has shut down. For that, you should be looking in your ma.log file. A likely candidate could be that you have the brewersdashboard module installed, which without a license will shut down Mango in 8 hours. An unlicensed NoSQL module will do likewise.
-
Hi,
Mango restarts are due to out of the memory...after restar I noticed in syslog:Normal free:11516kB min:3836kB low:4792kB high:5752kB active_anon:272132kB inactive_anon:334892kB active_file:158044kB inactive_file:109544kB unevictable:2264kB isolated(anon):128kB isolated(file):0kB present:966656kB managed:942128kB mlocked:2264kB dirty:0kB writeback:296kB mapped:42120kB shmem:4856kB slab_reclaimable:16536kB slab_unreclaimable:17068kB kernel_stack:3088kB pagetables:3896kB unstable:0kB bounce:0kB free_pcp:96kB local_pcp:0kB free_cma:28kB writeback_tmp:0kB pages_scanned:708 all_unreclaimable? no [1316838.230279] lowmem_reserve[]: 0 0 [1316838.230300] Normal: 1245*4kB (UME) 828*8kB (UME) 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 11604kB [1316838.230377] 88560 total pagecache pages [1316838.230395] 20168 pages in swap cache [1316838.230407] Swap cache stats: add 12675710, delete 12655542, find 13533006/17454463 [1316838.230418] Free swap = 1621416kB [1316838.230427] Total swap = 2097148kB [1316838.230438] 241664 pages RAM [1316838.230448] 0 pages HighMem/MovableOnly [1316838.230457] 6132 pages reserved [1316838.230467] 2048 pages cma reserved [1316838.230477] [ pid ] uid tgid total_vm rss nr_ptes nr_pmds swapents oom_score_adj name [1316838.230539] [ 436] 0 436 1285 599 7 0 37 0 systemd-journal [1316838.230563] [ 466] 0 466 5172 98 6 0 0 0 lvmetad [1316838.230587] [ 505] 0 505 2968 314 8 0 96 -1000 systemd-udevd [1316838.230608] [ 714] 100 714 3064 314 7 0 43 0 systemd-timesyn [1316838.230628] [ 1020] 0 1020 471 19 5 0 0 0 iscsid [1316838.230651] [ 1021] 0 1021 586 531 5 0 0 -17 iscsid [1316838.230673] [ 1027] 104 1027 7794 471 10 0 77 0 rsyslogd [1316838.230696] [ 1030] 107 1030 1303 526 6 0 91 -900 dbus-daemon [1316838.230721] [ 1036] 0 1036 9464 607 12 0 94 0 accounts-daemon [1316838.230742] [ 1041] 0 1041 712 265 6 0 26 0 atd [1316838.230765] [ 1045] 0 1045 1540 314 7 0 31 0 cron [1316838.230785] [ 1051] 0 1051 965 417 6 0 174 0 systemd-logind [1316838.230806] [ 1059] 0 1059 1997 486 8 0 105 -1000 sshd [1316838.230829] [ 1061] 0 1061 9831 243 12 0 21 0 lxcfs [1316838.230848] [ 1070] 0 1070 346 208 4 0 0 0 acpid [1316838.230868] [ 1118] 0 1118 1901 372 9 0 0 0 su [1316838.230888] [ 1119] 0 1119 1901 377 9 0 6 0 su [1316838.230909] [ 1136] 0 1136 8710 458 13 0 60 0 polkitd [1316838.230929] [ 1203] 8 1203 779 323 5 0 7 0 nullmailer-send [1316838.230951] [ 1242] 0 1242 832 300 6 0 17 0 irqbalance [1316838.230971] [ 1246] 1001 1246 1614 389 7 0 430 0 systemd [1316838.230995] [ 1256] 1001 1256 1859 9 7 0 463 0 (sd-pam) [1316838.231015] [ 1277] 0 1277 1272 252 6 0 0 0 agetty [1316838.231035] [ 1278] 0 1278 1012 203 7 0 0 0 agetty [1316838.231055] [ 1286] 0 1286 1509 816 6 0 0 0 systemd [1316838.231084] [ 1289] 0 1289 1859 67 7 0 405 0 (sd-pam) [1316838.231105] [ 1301] 1001 1301 1756 678 7 0 29 0 modbuspoll.sh [1316838.231126] [ 1309] 0 1309 1521 297 7 0 88 0 mysqld_safe [1316838.231147] [ 1312] 1001 1312 341 254 4 0 8 0 autossh [1316838.231167] [ 1346] 0 1346 4481 867 8 0 0 0 rpiShutdown.py [1316838.231190] [ 1630] 0 1630 36544 32367 74 0 1307 0 landscape-clien [1316838.231210] [ 1635] 112 1635 34461 8799 59 0 12656 0 landscape-broke [1316838.231231] [ 1636] 112 1636 27416 19685 55 0 512 0 landscape-monit [1316838.231252] [ 1637] 0 1637 23030 3417 49 0 16374 0 landscape-manag [1316838.231274] [ 1693] 1001 1693 341 253 4 0 8 0 autossh [1316838.231294] [ 1878] 1001 1878 341 257 5 0 5 0 autossh [1316838.231316] [20894] 0 20894 552 265 5 0 16 0 mdadm [1316838.231337] [18724] 114 18724 151608 4448 145 0 41850 0 mysqld [1316838.231359] [18725] 0 18725 1705 99 8 0 45 0 logger [1316838.231381] [17485] 115 17485 1426 280 7 0 93 0 karaf [1316838.231402] [17640] 115 17640 113220 29810 156 0 24123 0 java [1316838.231423] [14474] 1001 14474 1499 741 8 0 88 0 ssh [1316838.231444] [15728] 1001 15728 1468 718 7 0 94 0 ssh [1316838.231464] [15749] 1001 15749 1468 744 8 0 91 0 ssh [1316838.231489] [15943] 0 15943 1490 550 7 0 0 0 ma-start.sh [1316838.231509] [15950] 0 15950 112695 53658 129 0 0 0 java [1316838.231530] [16202] 1001 16202 4647 3288 12 0 0 0 epmmodbuscontro [1316838.231553] [16203] 1001 16203 1159 80 7 0 0 0 sleep [1316838.231574] [16216] 0 16216 2945 1128 9 0 0 0 sshd [1316838.231594] [16219] 0 16219 630 184 5 0 0 0 systemd-cgroups [1316838.231612] Out of memory: Kill process 17640 (java) score 71 or sacrifice child [1316838.247650] Killed process 17640 (java) total-vm:452880kB, anon-rss:114704kB, file-rss:4536kB
My platform: Raspberry Pi 2,
Kernel: Linux rpi2pm 4.4.0-1046-raspi2 #53-Ubuntu SMP Sun Mar 5 13:25:01 UTC 2017 armv7l armv7l armv7l GNU/Linux
Time between restarts is rather random. It can be from a few minutes to sevral hours.
Best regards. -
Hi przemom,
Interesting! Some questions,
- Are you running two instances of Mango? If not, what's the other java process? It looks like it killed PID 17640 but I don't see that as being Mango, since ma-start and 15950 java belong to uid 0.
- Are you using any ext-enabled scripts to set the heap? I think the OS OOM-Killer should only be active if the system as a whole is starved for memory, which means you can probably set the -Xmx (max heap memory) to prevent Java from expanding its heap too large and aggravating the OS. Looks like your RPi has a gig of memory. Depending on what that other Java is doing, I would think you should try a static heap of half the available memory, so move
memory-small.sh
from Mango/bin/ext-available to ext-enabled and set them to-Xmx500m -Xms500m
then restart Mango. With a limited heap, Mango shouldn't use all the systems' memory and thus shouldn't be killed by the OOM-Killer
-
My RPI is a testing machine, so I test mango and openhab. I added memory-small.sh with changes You suggested. Tomorrow I will let You know if it help. Thanks