Skip to content

Instantly share code, notes, and snippets.

View ideepika's full-sized avatar
🎯
Focusing

Deepika Upadhyay ideepika

🎯
Focusing
View GitHub Profile
88 ../sysdeps/unix/sysv/linux/futex-internal.h: No such file or directory.
(gdb) thread apply bt all
Invalid thread ID: bt all
(gdb) thread apply all bt
Thread 17 (Thread 0x7fee5f7fe700 (LWP 14249)):
#0 0x00007fee8f275ad3 in futex_wait_cancelable (private=<optimized out>, expected=0, futex_word=0x7fee64007cc8) at ../sysdeps/unix/sysv/linux/futex-internal.h:88
#1 __pthread_cond_wait_common (abstime=0x0, mutex=0x7fee64007c78, cond=0x7fee64007ca0) at pthread_cond_wait.c:502
#2 __pthread_cond_wait (cond=0x7fee64007ca0, mutex=0x7fee64007c78) at pthread_cond_wait.c:655
#3 0x00007fee8530ae3c in std::condition_variable::wait(std::unique_lock<std::mutex>&) () from /usr/lib/ceph/libceph-common.so.2
prs:
2021-03-12T01:56:27.042 INFO:tasks.ceph.osd.0.smithi160.stderr:/home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/15.2.9-136-gdbb79e05/rpm/el8/BUILD/ceph-15.2.9-136-gdbb79e05/src/osd/PG.cc: In function 'virtual void PG::on_active_advmap(const OSDMapRef&)' thread 7f759466d700 time 2021-03-12T01:56:27.030656+0000
2021-03-12T01:56:27.042 INFO:tasks.ceph.osd.0.smithi160.stderr:/home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/15.2.9-136-gdbb79e05/rpm/el8/BUILD/ceph-15.2.9-136-gdbb79e05/src/osd/PG.cc: 1689: FAILED ceph_assert(!bad || !cct->_conf->osd_debug_verify_cached_snaps)
2021-03-12T01:56:27.042 INFO:tasks.ceph.osd.0.smithi160.stderr:2021-03-12T01:56:27.029+0000 7f759466d700 -1 osd.0 pg_epoch: 119 pg[57.d( v 118'71 (0'0,118'71] local-lis/les=75/76 n=31 ec=49/49 lis/c=75/75 les/c/f=76/76/0 sis=75)
2021-03-12T01:56:27.042 INFO:tasks.ceph.osd.0.smithi160.stderr:/home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/15.2.9-136-gdbb79e05/rpm/el8/BUILD/ceph-15.2.9-136-gdbb79e05/src/osd/PG.cc: In function 'virtual void PG::on_active_advmap(const OSDMapRef&)' thread 7f759466d700 time 2021-03-12T01:56:27.030656+0000
2021-03-12T01:56:27.042 INFO:tasks.ceph.osd.0.smithi160.stderr:/home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/15.2.9-136-gdbb79e05/rpm/el8/BUILD/ceph-15.2.9-136-gdbb79e05/src/osd/PG.cc: 1689: FAILED ceph_assert(!bad || !cct->_conf->osd_debug_verify_cached_snaps)
2021-03-12T01:56:27.042 INFO:tasks.ceph.osd.0.smithi160.stderr:2021-03-12T01:56:27.029+0000 7f759466d700 -1 osd.0 pg_epoch: 119 pg[57.d( v 118'71 (0'0,118'71] local-lis/les=75/76 n=31 ec=49/49 lis/c=75/75 les/c/f=76/76/0 sis=75) [0,6]
2021-03-09T18:53:38.960+0000 7fad8d3ba700 10 merge_log log((0'0,189'278], crt=189'278) from osd.0(0) into log((0'0,147'260], crt=147'260)
2021-03-09T18:53:38.960+0000 7fad893b2700 10 bluestore(/var/lib/ceph/osd/ceph-4) _txc_state_proc txc 0x564f80907880 prepare
2021-03-09T18:53:38.960+0000 7fad893b2700 20 bluestore(/var/lib/ceph/osd/ceph-4) _txc_finish_io 0x564f80907880
2021-03-09T18:53:38.960+0000 7fad8d3ba700 10 merge_log extending head to 189'278
2021-03-09T18:53:38.960+0000 7fad893b2700 10 bluestore(/var/lib/ceph/osd/ceph-4) _txc_state_proc txc 0x564f80907880 io_done
2021-03-09T18:53:38.960+0000 7fad9b523700 10 bluefs _flush 0x564f7344ee40 0xea3de~419 to file(ino 88 size 0xea3de mtime 2021-03-09T18:53:38.961357+0000 allocated f0000 extents [1:0x5f0000~10000,1:0x6d0000~10000,1:0x920000~10000,1:0x990000~20000,1:0x9e0000~50000,1:0xc10000~40000,1:0xc80000~10000])
2021-03-09T18:53:38.960+0000 7fad8d3ba700 20 merge_log cut point (usually last shared) is 0'0
2021-03-09T18:53:38.960+0000 7fad8d3ba700 20 merge_log
pick 9e497c26674 qa/suites: do not use notcmalloc flavor
pick 1c767c210d1 qa/valgrind.supp: suppress free[] warnings
pick b1e7de04072 rgw: fix trailing null in object names of multipart reuploads
pick 4a6b2ea86ef mgr/zabbix: format ceph.[{#POOL},percent_used as float
pick 486bdef847f qa: log-whitelist -> log-ignorelist
pick 2588e7264c8 qa/suites/multimds: log-whitelist -> log-ignorelist
pick 0eacd93b746 qa/suites/krbd/thrash: log-whitelist -> log-ignorelist
pick 11769abb8f2 qa/suites/rados/thrash: log-whitelist -> log-ignorelist
pick 79329d04dc9 ceph-volume: add some flexibility to bytes_to_extents
pick 0a2c5eb0420 ceph-volume: mock VGs with string properties in tests
RESEARCH ON IMPROVING CI/CD PIPELINE
====================================
**Low Hanging Fruits:**
* Do not build external libraries everytime, we do that for Jaeger if it's on
and I see the same for others, all submodules could be having maybe
monthly/weekly build(cron it) which could be reused.
* not sure but we already do clang static analysis
* maybe use ninja builds?
test rollforwarding:
```
diff --git a/src/osd/PGLog.h b/src/osd/PGLog.h
index 2588b4d2fa0..98f94c63279 100644
--- a/src/osd/PGLog.h
+++ b/src/osd/PGLog.h
@@ -154,6 +154,7 @@ public:
// recovery pointers
std::list<pg_log_entry_t>::iterator complete_to; // not inclusive of referenced item
1+0)=8}} trimq=[120~1] ps=[dd~1,125~1]] get_object_context: obc NOT found in cache: 3:5a96ee3e:::smithi05121792-2:72
1+0)=8}} trimq=[120~1] ps=[dd~1,125~1]] get_object_context: no obc for soid 3:5a96ee3e:::smithi05121792-2:72 and !can_create
first time object is unfound
misplaced 6
2021-02-05T04:43:30.812+0000 7f05d9a31700 10 osd.11 pg_epoch: 253 pg[3.1as0( v 243'428 (0'0,243'428] local-lis/les=252/253 n=8 ec=248/27 lis/c=252/248 les/c/f=253/249/0 sis=252) [11,3,6,2,0,9]/[11,3,NONE,2,0,NONE]p11(0) async=[6(2),9(5)] r=0 lpr=252 pi=[248,252)/1 crt=243'428 lcod 0'0 mlcod 0'0 active+recovering+undersized+remapped mbc={0={(1+0)=6},1={(0+1)=6},2={(0+1)=6},3={(0+1)=6},4={(0+1)=6},5={(0+1)=6}} trimq=[c9~1] ps=[c5~1,cd~1]] get_object_context: obc NOT found in cache: 3:58cffdd8:::smithi05121792-42:2e
got unfound object
2021-02-05T04:43:30.812+0000 7f05d9a31700 10 osd.11 pg_epoch: 253 pg[3.1as0( v 243'428 (0'0,243'428] local-lis/les=252/253 n=8 ec=248/27 lis/c=252/248 les/c/f=253/249/0 sis=252) [11,3,6,2,0,9]/[11,
  1. first recovery_unfound on pg[3.1as0:
Primary/Active/Recovering>: recovery has unfound, can't continue
2021-02-05T04:47:15.098+0000 7f05d5a29700  5 osd.11 pg_epoch: 470 pg[3.1as0( v 466'633 (0'0,466'633] local-lis/les=468/469 n=12 ec=248/27 lis/c=468/418 les/c/f=469/419/0 sis=468) [11,1,10,5,NONE,9]p11(0) r=0 lpr=468 pi=[418,468)/1 crt=466'633 lcod 453'629 mlcod 0'0 active+recovering+recovery_unfound+undersized+degraded mbc={0={(1+0)=4},1={(0+0)=4},2={(0+0)=4},3={(1+0)=4},4={(0+0)=4},5={(1+0)=4}} trimq=[f6~1,151~1,171~1]] exit Started/Primary/Active/Recovering 1.138904 5 0.000780

after state transition: Recovering >> UnfoundRecovery >> NotRecovering

  1. we go to recover replicas, with still unfound:
2021-02-05T04:47:15.098+0000 7f05d9a31700 10 osd.11 pg_epoch: 470 pg[3.1as0( v 466'633 (0'0,466'633] local-lis/les=468/469 n=12 ec=248/27 lis/c=468/418 les/c/f=469/419/0 sis=468) [11,1,10,5,NONE,9]p11(0) r=0 lpr=468 pi=[418,468)/1 crt=466'633 lcod 453'629 mlcod 0'0 active+recovering+undersized+

ok testing updated