Skip to content

Instantly share code, notes, and snippets.

@mrdrozdov
Created March 29, 2017 13:39
Show Gist options
  • Save mrdrozdov/fe890e7c2241d2e039a95fd203e7b6ca to your computer and use it in GitHub Desktop.
Save mrdrozdov/fe890e7c2241d2e039a95fd203e7b6ca to your computer and use it in GitHub Desktop.
multinli.log
17-03-25 12:22:02 [1] Flag values:
{ '?': None,
'actively_decay_learning_rate': True,
'batch_size': 32,
'branch_name': 'master',
'bucket_eval': True,
'ckpt_interval_steps': 5000,
'ckpt_on_best_dev_error': True,
'ckpt_path': '/home/dexter/logs/spinn',
'ckpt_step': 1000,
'clipping_max_value': 5.0,
'data_type': 'multisnli',
'debug': False,
'deque_length': None,
'embedding_data_path': '/home/dexter/data/glove/glove.840B.300d.txt',
'embedding_keep_rate': 0.9,
'encode_bidirectional': False,
'encode_num_layers': 1,
'encode_reverse': False,
'encode_style': None,
'eval_data_limit': -1,
'eval_data_path': '/home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl',
'eval_interval_steps': 500,
'eval_report_use_preds': True,
'eval_seq_length': None,
'evalb': False,
'expanded_eval_only_mode': False,
'experiment_name': 'spinn-multisnli-eid_01',
'gen_h': True,
'gpu': 0,
'help': None,
'helpshort': None,
'helpxml': None,
'init_range': 0.005,
'l2_lambda': 2.75e-05,
'lateral_tracking': True,
'learning_rate': 0.0003,
'learning_rate_decay_per_10k_steps': 0.75,
'load_best': False,
'log_path': '/home/dexter/logs/spinn',
'lowercase': False,
'metrics_interval_steps': 10,
'metrics_path': '/home/dexter/logs/spinn-runs',
'mlp_bn': True,
'mlp_dim': 1024,
'model_dim': 600,
'model_type': 'SPINN',
'num_mlp_layers': 2,
'num_samples': 0,
'optimizer_type': 'RMSprop',
'predict_leaf': True,
'predict_use_cell': True,
'rl_baseline': 'ema',
'rl_entropy': False,
'rl_entropy_beta': 0.001,
'rl_epsilon': 1.0,
'rl_epsilon_decay': 50000.0,
'rl_mu': 0.1,
'rl_reward': 'standard',
'rl_weight': 1.0,
'rl_whiten': False,
'semantic_classifier_keep_rate': 0.9,
'seq_length': 500,
'sha': '2bf8089be8b4737c6097cba003f0931b0283242c',
'show_progress_bar': True,
'shuffle_eval': False,
'shuffle_eval_seed': 123,
'smart_batching': True,
'statistics_interval_steps': 100,
'tracking_lstm_hidden_dim': 40,
'training_data_path': '/home/dexter/data/multinli_0.1/multinli_0.1_train.jsonl',
'training_steps': 250000,
'transition_weight': 0.6,
'use_difference_feature': True,
'use_encode': False,
'use_internal_parser': True,
'use_l2_cost': True,
'use_lengths': False,
'use_peano': True,
'use_product_feature': True,
'use_tracking_in_composition': True,
'validate_transitions': True,
'word_embedding_dim': 300,
'write_eval_report': False}
17-03-25 12:22:25 [1] In open vocabulary mode. Using loaded embeddings without fine-tuning.
17-03-25 12:22:25 [1] Constructing vocabulary...
17-03-25 12:22:26 [1] Found 82433 word types.
17-03-25 12:22:39 [1] Loading vocabulary with 73546 words from /home/dexter/data/glove/glove.840B.300d.txt
17-03-25 12:23:07 [1] Preprocessing training data.
17-03-25 12:23:49 [1] Preprocessing eval data: /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl
17-03-25 12:23:55 [1] Building model.
17-03-25 12:23:56 [1] Architecture: BaseModel (
(spinn): SPINN (
(reduce): Reduce (
(left): CustomLinear (300 -> 1500)
(right): CustomLinear (300 -> 1500)
(track): CustomLinear (40 -> 1500)
)
(tracker): Tracker (
(buf): CustomLinear (300 -> 160)
(stack1): CustomLinear (300 -> 160)
(stack2): CustomLinear (300 -> 160)
(lateral): CustomLinear (40 -> 160)
)
(transition_net): Linear (80 -> 2)
)
(mlp): MLP (
(bn_inp): BatchNorm1d(1200, eps=1e-05, momentum=0.1, affine=True)
(l0): CustomLinear (1200 -> 1024)
(bn0): BatchNorm1d(1024, eps=1e-05, momentum=0.1, affine=True)
(l1): CustomLinear (1024 -> 1024)
(bn1): BatchNorm1d(1024, eps=1e-05, momentum=0.1, affine=True)
(l2): CustomLinear (1024 -> 3)
)
(embed): Embed (
(projection): Linear (300 -> 600)
)
)
17-03-25 12:23:56 [1] Total params: 3581817.0
17-03-25 12:24:03 [1]
# ----- BEGIN: Log Configuration ----- #
17-03-25 12:24:03 [1] Flag-JSON: {"eval_seq_length": null, "lowercase": false, "clipping_max_value": 5.0, "use_peano": true, "log_path": "/home/dexter/logs/spinn", "embedding_keep_rate": 0.9, "rl_mu": 0.1, "training_data_path": "/home/dexter/data/multinli_0.1/multinli_0.1_train.jsonl", "use_difference_feature": true, "init_range": 0.005, "evalb": false, "rl_entropy": false, "rl_whiten": false, "show_progress_bar": true, "use_l2_cost": true, "actively_decay_learning_rate": true, "use_encode": false, "encode_style": null, "encode_num_layers": 1, "help": null, "use_lengths": false, "rl_entropy_beta": 0.001, "embedding_data_path": "/home/dexter/data/glove/glove.840B.300d.txt", "write_eval_report": false, "model_dim": 600, "ckpt_on_best_dev_error": true, "deque_length": null, "seq_length": 500, "predict_use_cell": true, "eval_data_limit": -1, "word_embedding_dim": 300, "use_internal_parser": true, "ckpt_path": "/home/dexter/logs/spinn", "expanded_eval_only_mode": false, "eval_report_use_preds": true, "?": null, "helpxml": null, "bucket_eval": true, "semantic_classifier_keep_rate": 0.9, "lateral_tracking": true, "eval_interval_steps": 500, "data_type": "multisnli", "metrics_interval_steps": 10, "helpshort": null, "rl_weight": 1.0, "learning_rate": 0.0003, "metrics_path": "/home/dexter/logs/spinn-runs", "gpu": 0, "batch_size": 32, "use_product_feature": true, "smart_batching": true, "branch_name": "master", "encode_bidirectional": false, "validate_transitions": true, "optimizer_type": "RMSprop", "rl_baseline": "ema", "shuffle_eval": false, "shuffle_eval_seed": 123, "l2_lambda": 2.75e-05, "training_steps": 250000, "debug": false, "gen_h": true, "use_tracking_in_composition": true, "tracking_lstm_hidden_dim": 40, "rl_reward": "standard", "rl_epsilon_decay": 50000.0, "mlp_dim": 1024, "statistics_interval_steps": 100, "predict_leaf": true, "encode_reverse": false, "learning_rate_decay_per_10k_steps": 0.75, "num_mlp_layers": 2, "load_best": false, "sha": "2bf8089be8b4737c6097cba003f0931b0283242c", "experiment_name": "spinn-multisnli-eid_01", "num_samples": 0, "model_type": "SPINN", "ckpt_interval_steps": 5000, "mlp_bn": true, "rl_epsilon": 1.0, "transition_weight": 0.6, "ckpt_step": 1000, "eval_data_path": "/home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl"}
17-03-25 12:24:03 [1] Train-Format: Step: {step} Acc: {class_acc:.5f} {transition_acc:.5f} Cost: {total_cost:.5f} {xent_cost:.5f} {transition_cost:.5f} {l2_cost:.5f} Time: {time:.5f}
17-03-25 12:24:03 [1] Train-Extra-Format: Train Extra: lr={learning_rate:.7f} inv={invalid:.7f} sub={struct:.7f}
17-03-25 12:24:03 [1] Eval-Format: Step: {step} Eval acc: {class_acc:.5f} {transition_acc:.5f} {filename} Time: {time:.5f}
17-03-25 12:24:03 [1] Eval-Extra-Format: Eval Extra: inv={inv:.7f}
17-03-25 12:24:03 [1] # ----- END: Log Configuration ----- #
17-03-25 12:24:03 [1] Training.
17-03-25 12:24:03 [1] Step: 0 Acc: 0.50000 0.65000 Cost: 1.95439 1.32103 0.41352 0.21984 Time: 0.00032
17-03-25 12:24:03 [1] Train Extra: lr=0.0003000 inv=1.0000000 sub=0.0000000
17-03-25 12:25:08 [1] Step: 100 Acc: 0.34250 0.76268 Cost: 2.05374 1.50317 0.32934 0.22123 Time: 0.00064
17-03-25 12:25:08 [1] Train Extra: lr=0.0002991 inv=0.8121875 sub=0.0000000
17-03-25 12:26:23 [1] Step: 200 Acc: 0.37938 0.76642 Cost: 1.85071 1.24511 0.38526 0.22034 Time: 0.00071
17-03-25 12:26:23 [1] Train Extra: lr=0.0002983 inv=0.6070313 sub=0.0000000
17-03-25 12:27:48 [1] Step: 300 Acc: 0.39813 0.76079 Cost: 1.66557 1.09645 0.34982 0.21930 Time: 0.00070
17-03-25 12:27:48 [1] Train Extra: lr=0.0002974 inv=0.4343750 sub=0.0000000
17-03-25 12:29:03 [1] Step: 400 Acc: 0.39313 0.76887 Cost: 1.68176 1.10038 0.36312 0.21826 Time: 0.00070
17-03-25 12:29:03 [1] Train Extra: lr=0.0002966 inv=0.3964063 sub=0.0000000
17-03-25 12:30:19 [1] Step: 500 Acc: 0.39844 0.76578 Cost: 1.83541 1.27153 0.34681 0.21706 Time: 0.00069
17-03-25 12:30:19 [1] Train Extra: lr=0.0002957 inv=0.3464063 sub=0.0000000
17-03-25 12:31:11 [1] Step: 500 Eval acc: 0.39852 0.77655 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 12:31:11 [1] Eval Extra: inv=0.3164753
17-03-25 12:32:26 [1] Step: 600 Acc: 0.40969 0.76364 Cost: 1.61752 1.07206 0.32967 0.21579 Time: 0.00069
17-03-25 12:32:26 [1] Train Extra: lr=0.0002949 inv=0.2776562 sub=0.0000000
17-03-25 12:33:42 [1] Step: 700 Acc: 0.44156 0.76262 Cost: 1.76364 1.19679 0.35248 0.21437 Time: 0.00068
17-03-25 12:33:42 [1] Train Extra: lr=0.0002940 inv=0.3323437 sub=0.0000000
17-03-25 12:34:54 [1] Step: 800 Acc: 0.40813 0.75632 Cost: 1.63475 1.13143 0.29044 0.21288 Time: 0.00065
17-03-25 12:34:54 [1] Train Extra: lr=0.0002932 inv=0.4165625 sub=0.0000000
17-03-25 12:36:15 [1] Step: 900 Acc: 0.43062 0.77212 Cost: 1.68800 1.11302 0.36383 0.21115 Time: 0.00070
17-03-25 12:36:15 [1] Train Extra: lr=0.0002923 inv=0.5193750 sub=0.0000000
17-03-25 12:37:34 [1] Step: 1000 Acc: 0.43250 0.77250 Cost: 1.69298 1.19135 0.29237 0.20926 Time: 0.00071
17-03-25 12:37:34 [1] Train Extra: lr=0.0002915 inv=0.5990625 sub=0.0000000
17-03-25 12:38:26 [1] Step: 1000 Eval acc: 0.46190 0.78542 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 12:38:26 [1] Eval Extra: inv=0.5682420
17-03-25 12:39:48 [1] Step: 1100 Acc: 0.44250 0.77437 Cost: 1.74088 1.21294 0.32066 0.20728 Time: 0.00070
17-03-25 12:39:48 [1] Train Extra: lr=0.0002907 inv=0.6142188 sub=0.0000000
17-03-25 12:41:09 [1] Step: 1200 Acc: 0.45188 0.78035 Cost: 1.52425 1.02425 0.29476 0.20525 Time: 0.00071
17-03-25 12:41:09 [1] Train Extra: lr=0.0002898 inv=0.6270313 sub=0.0000000
17-03-25 12:42:23 [1] Step: 1300 Acc: 0.42625 0.79125 Cost: 1.34814 0.86456 0.28059 0.20298 Time: 0.00072
17-03-25 12:42:23 [1] Train Extra: lr=0.0002890 inv=0.6712500 sub=0.0000000
17-03-25 12:43:42 [1] Step: 1400 Acc: 0.43875 0.78914 Cost: 1.55171 0.99565 0.35552 0.20053 Time: 0.00071
17-03-25 12:43:42 [1] Train Extra: lr=0.0002882 inv=0.7209375 sub=0.0000000
17-03-25 12:44:55 [1] Step: 1500 Acc: 0.44656 0.77754 Cost: 1.78472 1.28337 0.30328 0.19808 Time: 0.00070
17-03-25 12:44:55 [1] Train Extra: lr=0.0002873 inv=0.7131250 sub=0.0000000
17-03-25 12:45:47 [1] Step: 1500 Eval acc: 0.47383 0.79337 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 12:45:47 [1] Eval Extra: inv=0.7365835
17-03-25 12:45:47 [1] Checkpointing with new best dev accuracy of 0.473830
17-03-25 12:46:59 [1] Step: 1600 Acc: 0.45594 0.78119 Cost: 1.44303 0.99603 0.25142 0.19558 Time: 0.00068
17-03-25 12:46:59 [1] Train Extra: lr=0.0002865 inv=0.7526563 sub=0.0000000
17-03-25 12:48:19 [1] Step: 1700 Acc: 0.44844 0.79489 Cost: 1.58261 1.08556 0.30403 0.19302 Time: 0.00072
17-03-25 12:48:19 [1] Train Extra: lr=0.0002857 inv=0.7492188 sub=0.0000000
17-03-25 12:49:30 [1] Step: 1800 Acc: 0.45250 0.79429 Cost: 1.59793 1.10512 0.30239 0.19042 Time: 0.00068
17-03-25 12:49:30 [1] Train Extra: lr=0.0002849 inv=0.7306250 sub=0.0000000
17-03-25 12:50:38 [1] Step: 1900 Acc: 0.48156 0.78956 Cost: 1.64687 1.19953 0.25936 0.18798 Time: 0.00069
17-03-25 12:50:38 [1] Train Extra: lr=0.0002840 inv=0.6548437 sub=0.0000000
17-03-25 12:51:54 [1] Step: 2000 Acc: 0.46750 0.79571 Cost: 1.50675 1.08365 0.23772 0.18538 Time: 0.00071
17-03-25 12:51:54 [1] Train Extra: lr=0.0002832 inv=0.6664062 sub=0.0000000
17-03-25 12:52:47 [1] Step: 2000 Eval acc: 0.50398 0.80907 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 12:52:47 [1] Eval Extra: inv=0.6366497
17-03-25 12:52:47 [1] Checkpointing with new best dev accuracy of 0.503975
17-03-25 12:54:04 [1] Step: 2100 Acc: 0.49281 0.80517 Cost: 1.43064 0.95514 0.29265 0.18284 Time: 0.00069
17-03-25 12:54:04 [1] Train Extra: lr=0.0002824 inv=0.6156250 sub=0.0000000
17-03-25 12:55:12 [1] Step: 2200 Acc: 0.48438 0.80812 Cost: 1.45349 0.99554 0.27766 0.18030 Time: 0.00070
17-03-25 12:55:12 [1] Train Extra: lr=0.0002816 inv=0.5689062 sub=0.0000000
17-03-25 12:56:31 [1] Step: 2300 Acc: 0.49375 0.79922 Cost: 1.56481 1.04250 0.34449 0.17782 Time: 0.00072
17-03-25 12:56:31 [1] Train Extra: lr=0.0002808 inv=0.5881250 sub=0.0000000
17-03-25 12:57:48 [1] Step: 2400 Acc: 0.49312 0.80316 Cost: 1.40881 0.97310 0.26049 0.17521 Time: 0.00068
17-03-25 12:57:48 [1] Train Extra: lr=0.0002800 inv=0.6132812 sub=0.0000000
17-03-25 12:59:08 [1] Step: 2500 Acc: 0.49906 0.80039 Cost: 1.46869 1.11782 0.17823 0.17264 Time: 0.00071
17-03-25 12:59:08 [1] Train Extra: lr=0.0002792 inv=0.5679688 sub=0.0000000
17-03-25 13:00:01 [1] Step: 2500 Eval acc: 0.52142 0.80933 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 13:00:01 [1] Eval Extra: inv=0.5692911
17-03-25 13:00:01 [1] Checkpointing with new best dev accuracy of 0.521422
17-03-25 13:01:26 [1] Step: 2600 Acc: 0.50281 0.81646 Cost: 1.63090 1.14184 0.31891 0.17015 Time: 0.00072
17-03-25 13:01:26 [1] Train Extra: lr=0.0002784 inv=0.5957813 sub=0.0000000
17-03-25 13:02:45 [1] Step: 2700 Acc: 0.48000 0.80180 Cost: 1.51136 1.03261 0.31118 0.16757 Time: 0.00070
17-03-25 13:02:45 [1] Train Extra: lr=0.0002776 inv=0.5890625 sub=0.0000000
17-03-25 13:03:59 [1] Step: 2800 Acc: 0.50906 0.80804 Cost: 1.33601 0.90909 0.26182 0.16510 Time: 0.00070
17-03-25 13:03:59 [1] Train Extra: lr=0.0002768 inv=0.5420312 sub=0.0000000
17-03-25 13:05:16 [1] Step: 2900 Acc: 0.50938 0.80636 Cost: 1.33899 0.87865 0.29759 0.16275 Time: 0.00071
17-03-25 13:05:16 [1] Train Extra: lr=0.0002760 inv=0.5462500 sub=0.0000000
17-03-25 13:06:33 [1] Step: 3000 Acc: 0.53094 0.81199 Cost: 1.49967 1.00565 0.33348 0.16054 Time: 0.00072
17-03-25 13:06:33 [1] Train Extra: lr=0.0002752 inv=0.5117188 sub=0.0000000
17-03-25 13:07:27 [1] Step: 3000 Eval acc: 0.54042 0.81654 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 13:07:27 [1] Eval Extra: inv=0.5187721
17-03-25 13:07:27 [1] Checkpointing with new best dev accuracy of 0.540415
17-03-25 13:08:41 [1] Step: 3100 Acc: 0.50125 0.81180 Cost: 1.28894 0.86797 0.26257 0.15840 Time: 0.00070
17-03-25 13:08:41 [1] Train Extra: lr=0.0002744 inv=0.5264062 sub=0.0000000
17-03-25 13:09:59 [1] Step: 3200 Acc: 0.51969 0.81173 Cost: 1.33330 0.91302 0.26406 0.15622 Time: 0.00069
17-03-25 13:09:59 [1] Train Extra: lr=0.0002736 inv=0.5468750 sub=0.0000000
17-03-25 13:11:26 [1] Step: 3300 Acc: 0.53187 0.82178 Cost: 1.42956 0.95605 0.31939 0.15412 Time: 0.00075
17-03-25 13:11:26 [1] Train Extra: lr=0.0002728 inv=0.5354688 sub=0.0000000
17-03-25 13:12:36 [1] Step: 3400 Acc: 0.52719 0.81252 Cost: 1.40823 0.98658 0.26959 0.15206 Time: 0.00068
17-03-25 13:12:36 [1] Train Extra: lr=0.0002720 inv=0.5142187 sub=0.0000000
17-03-25 13:13:58 [1] Step: 3500 Acc: 0.52156 0.81686 Cost: 1.32857 0.96513 0.21345 0.14999 Time: 0.00074
17-03-25 13:13:58 [1] Train Extra: lr=0.0002713 inv=0.4973437 sub=0.0000000
17-03-25 13:14:53 [1] Step: 3500 Eval acc: 0.54770 0.82634 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 13:14:53 [1] Eval Extra: inv=0.5067359
17-03-25 13:14:53 [1] Checkpointing with new best dev accuracy of 0.547703
17-03-25 13:16:06 [1] Step: 3600 Acc: 0.52719 0.81069 Cost: 1.35194 0.97433 0.22955 0.14806 Time: 0.00070
17-03-25 13:16:06 [1] Train Extra: lr=0.0002705 inv=0.5070313 sub=0.0000000
17-03-25 13:17:28 [1] Step: 3700 Acc: 0.52625 0.82616 Cost: 1.39095 1.03218 0.21255 0.14622 Time: 0.00074
17-03-25 13:17:28 [1] Train Extra: lr=0.0002697 inv=0.5184375 sub=0.0000000
17-03-25 13:18:44 [1] Step: 3800 Acc: 0.53031 0.81177 Cost: 1.20582 0.77464 0.28681 0.14436 Time: 0.00068
17-03-25 13:18:44 [1] Train Extra: lr=0.0002689 inv=0.5135937 sub=0.0000000
17-03-25 13:20:06 [1] Step: 3900 Acc: 0.52531 0.81840 Cost: 1.30600 0.86957 0.29397 0.14246 Time: 0.00073
17-03-25 13:20:06 [1] Train Extra: lr=0.0002682 inv=0.5006250 sub=0.0000000
17-03-25 13:21:17 [1] Step: 4000 Acc: 0.53469 0.82776 Cost: 1.31233 0.92686 0.24480 0.14067 Time: 0.00070
17-03-25 13:21:17 [1] Train Extra: lr=0.0002674 inv=0.4832812 sub=0.0000000
17-03-25 13:22:10 [1] Step: 4000 Eval acc: 0.54826 0.82175 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 13:22:10 [1] Eval Extra: inv=0.5276060
17-03-25 13:23:38 [1] Step: 4100 Acc: 0.53219 0.82407 Cost: 1.28495 0.91747 0.22866 0.13882 Time: 0.00074
17-03-25 13:23:38 [1] Train Extra: lr=0.0002666 inv=0.5520312 sub=0.0000000
17-03-25 13:24:59 [1] Step: 4200 Acc: 0.51062 0.82012 Cost: 1.33435 0.93627 0.26084 0.13724 Time: 0.00072
17-03-25 13:24:59 [1] Train Extra: lr=0.0002659 inv=0.5201562 sub=0.0000000
17-03-25 13:26:12 [1] Step: 4300 Acc: 0.54031 0.81816 Cost: 1.37156 0.97221 0.26376 0.13559 Time: 0.00070
17-03-25 13:26:12 [1] Train Extra: lr=0.0002651 inv=0.4826563 sub=0.0000000
17-03-25 13:27:23 [1] Step: 4400 Acc: 0.54375 0.82293 Cost: 1.12873 0.81513 0.17942 0.13418 Time: 0.00068
17-03-25 13:27:23 [1] Train Extra: lr=0.0002643 inv=0.4870313 sub=0.0000000
17-03-25 13:28:43 [1] Step: 4500 Acc: 0.52687 0.82540 Cost: 1.28448 0.85781 0.29397 0.13271 Time: 0.00073
17-03-25 13:28:43 [1] Train Extra: lr=0.0002636 inv=0.4910937 sub=0.0000000
17-03-25 13:29:35 [1] Step: 4500 Eval acc: 0.56383 0.83190 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 13:29:35 [1] Eval Extra: inv=0.4605234
17-03-25 13:29:35 [1] Checkpointing with new best dev accuracy of 0.563825
17-03-25 13:30:55 [1] Step: 4600 Acc: 0.54719 0.83723 Cost: 1.28876 0.86712 0.29043 0.13121 Time: 0.00075
17-03-25 13:30:55 [1] Train Extra: lr=0.0002628 inv=0.5028125 sub=0.0000000
17-03-25 13:32:12 [1] Step: 4700 Acc: 0.53531 0.83128 Cost: 1.40526 0.93857 0.33691 0.12978 Time: 0.00072
17-03-25 13:32:12 [1] Train Extra: lr=0.0002621 inv=0.4978125 sub=0.0000000
17-03-25 13:33:33 [1] Step: 4800 Acc: 0.55188 0.81845 Cost: 1.05536 0.72330 0.20349 0.12857 Time: 0.00071
17-03-25 13:33:33 [1] Train Extra: lr=0.0002613 inv=0.5167188 sub=0.0000000
17-03-25 13:34:54 [1] Step: 4900 Acc: 0.50813 0.82556 Cost: 1.31892 0.93434 0.25745 0.12713 Time: 0.00072
17-03-25 13:34:54 [1] Train Extra: lr=0.0002606 inv=0.5117188 sub=0.0000000
17-03-25 13:36:26 [1] Step: 5000 Acc: 0.52750 0.83769 Cost: 1.41080 0.99121 0.29372 0.12587 Time: 0.00076
17-03-25 13:36:26 [1] Train Extra: lr=0.0002598 inv=0.5098438 sub=0.0000000
17-03-25 13:37:18 [1] Step: 5000 Eval acc: 0.57586 0.82880 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 13:37:18 [1] Eval Extra: inv=0.4653269
17-03-25 13:37:18 [1] Checkpointing with new best dev accuracy of 0.575861
17-03-25 13:37:18 [1] Checkpointing.
17-03-25 13:38:32 [1] Step: 5100 Acc: 0.54125 0.83089 Cost: 1.37911 1.02012 0.23426 0.12473 Time: 0.00070
17-03-25 13:38:32 [1] Train Extra: lr=0.0002591 inv=0.4720313 sub=0.0000000
17-03-25 13:39:41 [1] Step: 5200 Acc: 0.54063 0.81876 Cost: 1.19172 0.84117 0.22704 0.12351 Time: 0.00066
17-03-25 13:39:41 [1] Train Extra: lr=0.0002583 inv=0.4775000 sub=0.0000000
17-03-25 13:41:05 [1] Step: 5300 Acc: 0.53500 0.82345 Cost: 1.41942 1.04003 0.25704 0.12235 Time: 0.00072
17-03-25 13:41:05 [1] Train Extra: lr=0.0002576 inv=0.5364062 sub=0.0000000
17-03-25 13:42:12 [1] Step: 5400 Acc: 0.54844 0.82811 Cost: 1.43800 1.00371 0.31306 0.12123 Time: 0.00068
17-03-25 13:42:12 [1] Train Extra: lr=0.0002568 inv=0.4640625 sub=0.0000000
17-03-25 13:43:29 [1] Step: 5500 Acc: 0.53469 0.82179 Cost: 1.58373 1.19146 0.27217 0.12010 Time: 0.00069
17-03-25 13:43:29 [1] Train Extra: lr=0.0002561 inv=0.4895312 sub=0.0000000
17-03-25 13:44:22 [1] Step: 5500 Eval acc: 0.58138 0.83132 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 13:44:22 [1] Eval Extra: inv=0.4731117
17-03-25 13:44:22 [1] Checkpointing with new best dev accuracy of 0.581383
17-03-25 13:45:47 [1] Step: 5600 Acc: 0.54438 0.82182 Cost: 1.19728 0.80368 0.27465 0.11895 Time: 0.00071
17-03-25 13:45:47 [1] Train Extra: lr=0.0002554 inv=0.5189063 sub=0.0000000
17-03-25 13:47:04 [1] Step: 5700 Acc: 0.54125 0.83132 Cost: 1.29913 0.98867 0.19260 0.11786 Time: 0.00070
17-03-25 13:47:04 [1] Train Extra: lr=0.0002546 inv=0.4851563 sub=0.0000000
17-03-25 13:48:25 [1] Step: 5800 Acc: 0.54281 0.82683 Cost: 1.51340 1.05847 0.33816 0.11677 Time: 0.00071
17-03-25 13:48:25 [1] Train Extra: lr=0.0002539 inv=0.5115625 sub=0.0000000
17-03-25 13:49:39 [1] Step: 5900 Acc: 0.53969 0.82252 Cost: 1.31163 1.07745 0.11846 0.11572 Time: 0.00069
17-03-25 13:49:39 [1] Train Extra: lr=0.0002532 inv=0.4742188 sub=0.0000000
17-03-25 13:50:58 [1] Step: 6000 Acc: 0.55750 0.82029 Cost: 1.26885 0.96800 0.18607 0.11478 Time: 0.00070
17-03-25 13:50:58 [1] Train Extra: lr=0.0002524 inv=0.5196875 sub=0.0000000
17-03-25 13:51:50 [1] Step: 6000 Eval acc: 0.57277 0.82847 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 13:51:50 [1] Eval Extra: inv=0.4921047
17-03-25 13:53:04 [1] Step: 6100 Acc: 0.55625 0.82875 Cost: 1.48307 1.08026 0.28900 0.11381 Time: 0.00069
17-03-25 13:53:04 [1] Train Extra: lr=0.0002517 inv=0.4790625 sub=0.0000000
17-03-25 13:54:12 [1] Step: 6200 Acc: 0.54906 0.82755 Cost: 1.26537 0.91958 0.23293 0.11285 Time: 0.00067
17-03-25 13:54:12 [1] Train Extra: lr=0.0002510 inv=0.4682812 sub=0.0000000
17-03-25 13:55:36 [1] Step: 6300 Acc: 0.55531 0.83015 Cost: 1.23105 0.89721 0.22187 0.11197 Time: 0.00073
17-03-25 13:55:36 [1] Train Extra: lr=0.0002503 inv=0.4721875 sub=0.0000000
17-03-25 13:56:45 [1] Step: 6400 Acc: 0.57500 0.83111 Cost: 1.27325 0.87487 0.28733 0.11106 Time: 0.00068
17-03-25 13:56:45 [1] Train Extra: lr=0.0002496 inv=0.4707812 sub=0.0000000
17-03-25 13:58:12 [1] Step: 6500 Acc: 0.55812 0.82992 Cost: 1.41617 1.08346 0.22257 0.11013 Time: 0.00073
17-03-25 13:58:12 [1] Train Extra: lr=0.0002488 inv=0.5106250 sub=0.0000000
17-03-25 13:59:03 [1] Step: 6500 Eval acc: 0.58392 0.83479 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 13:59:03 [1] Eval Extra: inv=0.4606890
17-03-25 14:00:19 [1] Step: 6600 Acc: 0.53844 0.82657 Cost: 1.33639 0.93708 0.29006 0.10925 Time: 0.00068
17-03-25 14:00:19 [1] Train Extra: lr=0.0002481 inv=0.4895312 sub=0.0000000
17-03-25 14:01:30 [1] Step: 6700 Acc: 0.56219 0.83188 Cost: 1.56817 1.18393 0.27578 0.10845 Time: 0.00070
17-03-25 14:01:30 [1] Train Extra: lr=0.0002474 inv=0.4667188 sub=0.0000000
17-03-25 14:02:46 [1] Step: 6800 Acc: 0.57406 0.82084 Cost: 1.28163 0.93774 0.23623 0.10765 Time: 0.00068
17-03-25 14:02:46 [1] Train Extra: lr=0.0002467 inv=0.5217188 sub=0.0000000
17-03-25 14:04:06 [1] Step: 6900 Acc: 0.55812 0.82445 Cost: 1.11439 0.73499 0.27265 0.10675 Time: 0.00070
17-03-25 14:04:06 [1] Train Extra: lr=0.0002460 inv=0.4726562 sub=0.0000000
17-03-25 14:05:20 [1] Step: 7000 Acc: 0.56875 0.82297 Cost: 1.15830 0.75822 0.29404 0.10604 Time: 0.00070
17-03-25 14:05:20 [1] Train Extra: lr=0.0002453 inv=0.4504687 sub=0.0000000
17-03-25 14:06:12 [1] Step: 7000 Eval acc: 0.59607 0.83462 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 14:06:12 [1] Eval Extra: inv=0.4767005
17-03-25 14:06:12 [1] Checkpointing with new best dev accuracy of 0.596069
17-03-25 14:07:32 [1] Step: 7100 Acc: 0.56688 0.83089 Cost: 0.91995 0.69856 0.11607 0.10532 Time: 0.00071
17-03-25 14:07:32 [1] Train Extra: lr=0.0002446 inv=0.4789062 sub=0.0000000
17-03-25 14:08:57 [1] Step: 7200 Acc: 0.55437 0.82526 Cost: 1.42132 0.95114 0.36559 0.10460 Time: 0.00072
17-03-25 14:08:57 [1] Train Extra: lr=0.0002439 inv=0.4917187 sub=0.0000000
17-03-25 14:10:05 [1] Step: 7300 Acc: 0.56250 0.82608 Cost: 1.40106 1.06529 0.23189 0.10388 Time: 0.00067
17-03-25 14:10:05 [1] Train Extra: lr=0.0002432 inv=0.4296875 sub=0.0000000
17-03-25 14:11:31 [1] Step: 7400 Acc: 0.57656 0.84186 Cost: 1.39486 1.00127 0.29035 0.10325 Time: 0.00074
17-03-25 14:11:31 [1] Train Extra: lr=0.0002425 inv=0.4718750 sub=0.0000000
17-03-25 14:12:41 [1] Step: 7500 Acc: 0.56969 0.82966 Cost: 0.93702 0.70615 0.12813 0.10274 Time: 0.00066
17-03-25 14:12:41 [1] Train Extra: lr=0.0002418 inv=0.4593750 sub=0.0000000
17-03-25 14:13:36 [1] Step: 7500 Eval acc: 0.59000 0.83091 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 14:13:36 [1] Eval Extra: inv=0.5249006
17-03-25 14:14:46 [1] Step: 7600 Acc: 0.56563 0.82514 Cost: 1.07348 0.75685 0.21447 0.10216 Time: 0.00067
17-03-25 14:14:46 [1] Train Extra: lr=0.0002411 inv=0.4562500 sub=0.0000000
17-03-25 14:16:10 [1] Step: 7700 Acc: 0.57531 0.82819 Cost: 1.19868 0.79518 0.30203 0.10147 Time: 0.00074
17-03-25 14:16:10 [1] Train Extra: lr=0.0002404 inv=0.4737500 sub=0.0000000
17-03-25 14:17:32 [1] Step: 7800 Acc: 0.57125 0.82875 Cost: 1.43753 1.06581 0.27078 0.10093 Time: 0.00070
17-03-25 14:17:32 [1] Train Extra: lr=0.0002397 inv=0.4689063 sub=0.0000000
17-03-25 14:18:52 [1] Step: 7900 Acc: 0.56531 0.82769 Cost: 1.20441 0.86838 0.23573 0.10030 Time: 0.00070
17-03-25 14:18:52 [1] Train Extra: lr=0.0002390 inv=0.4743750 sub=0.0000000
17-03-25 14:20:05 [1] Step: 8000 Acc: 0.57250 0.83001 Cost: 1.20348 0.81902 0.28472 0.09974 Time: 0.00068
17-03-25 14:20:05 [1] Train Extra: lr=0.0002383 inv=0.4612500 sub=0.0000000
17-03-25 14:20:56 [1] Step: 8000 Eval acc: 0.60071 0.83716 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 14:20:56 [1] Eval Extra: inv=0.4272306
17-03-25 14:20:56 [1] Checkpointing with new best dev accuracy of 0.600707
17-03-25 14:22:12 [1] Step: 8100 Acc: 0.57125 0.82654 Cost: 1.24109 0.99324 0.14869 0.09916 Time: 0.00067
17-03-25 14:22:12 [1] Train Extra: lr=0.0002376 inv=0.4381250 sub=0.0000000
17-03-25 14:23:30 [1] Step: 8200 Acc: 0.56781 0.82704 Cost: 1.28042 0.89977 0.28199 0.09866 Time: 0.00071
17-03-25 14:23:30 [1] Train Extra: lr=0.0002370 inv=0.4596875 sub=0.0000000
17-03-25 14:24:58 [1] Step: 8300 Acc: 0.57594 0.83302 Cost: 0.98129 0.73582 0.14726 0.09820 Time: 0.00073
17-03-25 14:24:58 [1] Train Extra: lr=0.0002363 inv=0.4573437 sub=0.0000000
17-03-25 14:26:26 [1] Step: 8400 Acc: 0.57531 0.83755 Cost: 0.96195 0.77773 0.08657 0.09765 Time: 0.00076
17-03-25 14:26:26 [1] Train Extra: lr=0.0002356 inv=0.4212500 sub=0.0000000
17-03-25 14:27:42 [1] Step: 8500 Acc: 0.58344 0.83287 Cost: 1.12668 0.84423 0.18532 0.09714 Time: 0.00070
17-03-25 14:27:42 [1] Train Extra: lr=0.0002349 inv=0.4382813 sub=0.0000000
17-03-25 14:28:34 [1] Step: 8500 Eval acc: 0.60049 0.83938 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 14:28:34 [1] Eval Extra: inv=0.4383834
17-03-25 14:30:01 [1] Step: 8600 Acc: 0.56469 0.83629 Cost: 1.20064 0.86644 0.23750 0.09669 Time: 0.00074
17-03-25 14:30:01 [1] Train Extra: lr=0.0002342 inv=0.4257812 sub=0.0000000
17-03-25 14:31:13 [1] Step: 8700 Acc: 0.56375 0.82893 Cost: 1.30507 0.98324 0.22562 0.09621 Time: 0.00068
17-03-25 14:31:13 [1] Train Extra: lr=0.0002336 inv=0.4700000 sub=0.0000000
17-03-25 14:32:34 [1] Step: 8800 Acc: 0.57281 0.83086 Cost: 1.18475 0.84177 0.24725 0.09573 Time: 0.00071
17-03-25 14:32:34 [1] Train Extra: lr=0.0002329 inv=0.4390625 sub=0.0000000
17-03-25 14:33:43 [1] Step: 8900 Acc: 0.58594 0.82617 Cost: 1.32555 0.94779 0.28243 0.09533 Time: 0.00067
17-03-25 14:33:43 [1] Train Extra: lr=0.0002322 inv=0.4560938 sub=0.0000000
17-03-25 14:34:57 [1] Step: 9000 Acc: 0.59750 0.82338 Cost: 1.15815 0.82514 0.23809 0.09492 Time: 0.00069
17-03-25 14:34:57 [1] Train Extra: lr=0.0002316 inv=0.4601562 sub=0.0000000
17-03-25 14:35:48 [1] Step: 9000 Eval acc: 0.60512 0.83511 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 14:35:48 [1] Eval Extra: inv=0.4458370
17-03-25 14:35:48 [1] Checkpointing with new best dev accuracy of 0.605124
17-03-25 14:37:04 [1] Step: 9100 Acc: 0.58531 0.83253 Cost: 1.13969 0.90025 0.14498 0.09446 Time: 0.00069
17-03-25 14:37:04 [1] Train Extra: lr=0.0002309 inv=0.4457813 sub=0.0000000
17-03-25 14:38:28 [1] Step: 9200 Acc: 0.56812 0.84133 Cost: 0.87891 0.68839 0.09655 0.09397 Time: 0.00075
17-03-25 14:38:28 [1] Train Extra: lr=0.0002302 inv=0.4429688 sub=0.0000000
17-03-25 14:39:44 [1] Step: 9300 Acc: 0.59156 0.83117 Cost: 1.21511 0.90635 0.21522 0.09354 Time: 0.00070
17-03-25 14:39:44 [1] Train Extra: lr=0.0002296 inv=0.4328125 sub=0.0000000
17-03-25 14:41:04 [1] Step: 9400 Acc: 0.57906 0.84077 Cost: 1.20663 0.90392 0.20947 0.09324 Time: 0.00072
17-03-25 14:41:04 [1] Train Extra: lr=0.0002289 inv=0.4365625 sub=0.0000000
17-03-25 14:42:24 [1] Step: 9500 Acc: 0.58406 0.83332 Cost: 1.26765 0.99507 0.17976 0.09282 Time: 0.00072
17-03-25 14:42:24 [1] Train Extra: lr=0.0002283 inv=0.4159375 sub=0.0000000
17-03-25 14:43:15 [1] Step: 9500 Eval acc: 0.60645 0.84093 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 14:43:15 [1] Eval Extra: inv=0.4302672
17-03-25 14:44:27 [1] Step: 9600 Acc: 0.58437 0.83004 Cost: 1.25321 0.91010 0.25068 0.09244 Time: 0.00068
17-03-25 14:44:27 [1] Train Extra: lr=0.0002276 inv=0.4234375 sub=0.0000000
17-03-25 14:45:40 [1] Step: 9700 Acc: 0.57531 0.82806 Cost: 1.14660 0.77704 0.27749 0.09208 Time: 0.00068
17-03-25 14:45:40 [1] Train Extra: lr=0.0002270 inv=0.4234375 sub=0.0000000
17-03-25 14:47:00 [1] Step: 9800 Acc: 0.58656 0.83704 Cost: 1.35758 0.99430 0.27152 0.09176 Time: 0.00071
17-03-25 14:47:00 [1] Train Extra: lr=0.0002263 inv=0.4193750 sub=0.0000000
17-03-25 14:48:15 [1] Step: 9900 Acc: 0.58375 0.82916 Cost: 1.17797 0.83512 0.25136 0.09148 Time: 0.00068
17-03-25 14:48:15 [1] Train Extra: lr=0.0002256 inv=0.4395312 sub=0.0000000
17-03-25 14:49:28 [1] Step: 10000 Acc: 0.58031 0.83275 Cost: 1.09271 0.86589 0.13566 0.09116 Time: 0.00069
17-03-25 14:49:28 [1] Train Extra: lr=0.0002250 inv=0.4281250 sub=0.0000000
17-03-25 14:50:20 [1] Step: 10000 Eval acc: 0.60976 0.83074 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 14:50:20 [1] Eval Extra: inv=0.4532906
17-03-25 14:50:20 [1] Checkpointing with new best dev accuracy of 0.609761
17-03-25 14:50:20 [1] Checkpointing.
17-03-25 14:51:48 [1] Step: 10100 Acc: 0.57375 0.83282 Cost: 1.34719 0.95563 0.30079 0.09077 Time: 0.00071
17-03-25 14:51:48 [1] Train Extra: lr=0.0002244 inv=0.4709375 sub=0.0000000
17-03-25 14:53:00 [1] Step: 10200 Acc: 0.59906 0.83190 Cost: 1.41283 1.06191 0.26041 0.09051 Time: 0.00070
17-03-25 14:53:00 [1] Train Extra: lr=0.0002237 inv=0.4237500 sub=0.0000000
17-03-25 14:54:20 [1] Step: 10300 Acc: 0.57594 0.83108 Cost: 1.49033 1.04147 0.35868 0.09018 Time: 0.00069
17-03-25 14:54:20 [1] Train Extra: lr=0.0002231 inv=0.4387500 sub=0.0000000
17-03-25 14:55:35 [1] Step: 10400 Acc: 0.59000 0.82648 Cost: 1.32482 0.94919 0.28588 0.08975 Time: 0.00068
17-03-25 14:55:35 [1] Train Extra: lr=0.0002224 inv=0.4426562 sub=0.0000000
17-03-25 14:56:45 [1] Step: 10500 Acc: 0.58906 0.83865 Cost: 1.18514 0.87316 0.22256 0.08941 Time: 0.00071
17-03-25 14:56:45 [1] Train Extra: lr=0.0002218 inv=0.4037500 sub=0.0000000
17-03-25 14:57:34 [1] Step: 10500 Eval acc: 0.60877 0.84358 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00014
17-03-25 14:57:34 [1] Eval Extra: inv=0.3812390
17-03-25 14:59:03 [1] Step: 10600 Acc: 0.57437 0.84170 Cost: 1.32841 0.95266 0.28675 0.08900 Time: 0.00076
17-03-25 14:59:03 [1] Train Extra: lr=0.0002211 inv=0.4220313 sub=0.0000000
17-03-25 15:00:17 [1] Step: 10700 Acc: 0.57750 0.82476 Cost: 1.16177 0.85485 0.21816 0.08877 Time: 0.00067
17-03-25 15:00:17 [1] Train Extra: lr=0.0002205 inv=0.4712500 sub=0.0000000
17-03-25 15:01:32 [1] Step: 10800 Acc: 0.58156 0.83272 Cost: 1.45574 0.98032 0.38695 0.08846 Time: 0.00068
17-03-25 15:01:32 [1] Train Extra: lr=0.0002199 inv=0.4371875 sub=0.0000000
17-03-25 15:02:37 [1] Step: 10900 Acc: 0.59250 0.83309 Cost: 1.12985 0.85524 0.18652 0.08810 Time: 0.00067
17-03-25 15:02:37 [1] Train Extra: lr=0.0002192 inv=0.3979687 sub=0.0000000
17-03-25 15:03:55 [1] Step: 11000 Acc: 0.58000 0.82674 Cost: 1.21822 0.86807 0.26236 0.08780 Time: 0.00068
17-03-25 15:03:55 [1] Train Extra: lr=0.0002186 inv=0.4179688 sub=0.0000000
17-03-25 15:04:47 [1] Step: 11000 Eval acc: 0.61871 0.83488 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 15:04:47 [1] Eval Extra: inv=0.4191696
17-03-25 15:04:47 [1] Checkpointing with new best dev accuracy of 0.618706
17-03-25 15:06:07 [1] Step: 11100 Acc: 0.59062 0.83795 Cost: 1.09128 0.75738 0.24635 0.08755 Time: 0.00073
17-03-25 15:06:07 [1] Train Extra: lr=0.0002180 inv=0.4214062 sub=0.0000000
17-03-25 15:07:21 [1] Step: 11200 Acc: 0.57250 0.83281 Cost: 1.42421 1.10584 0.23112 0.08725 Time: 0.00068
17-03-25 15:07:21 [1] Train Extra: lr=0.0002174 inv=0.4193750 sub=0.0000000
17-03-25 15:08:29 [1] Step: 11300 Acc: 0.59437 0.83095 Cost: 1.12518 0.76563 0.27251 0.08704 Time: 0.00067
17-03-25 15:08:29 [1] Train Extra: lr=0.0002167 inv=0.4028125 sub=0.0000000
17-03-25 15:09:41 [1] Step: 11400 Acc: 0.59469 0.83378 Cost: 1.29369 0.92014 0.28685 0.08670 Time: 0.00069
17-03-25 15:09:41 [1] Train Extra: lr=0.0002161 inv=0.4193750 sub=0.0000000
17-03-25 15:11:07 [1] Step: 11500 Acc: 0.58562 0.84260 Cost: 1.29932 0.89230 0.32058 0.08644 Time: 0.00073
17-03-25 15:11:07 [1] Train Extra: lr=0.0002155 inv=0.4259375 sub=0.0000000
17-03-25 15:11:58 [1] Step: 11500 Eval acc: 0.61484 0.84091 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 15:11:58 [1] Eval Extra: inv=0.4279483
17-03-25 15:13:06 [1] Step: 11600 Acc: 0.60562 0.83800 Cost: 1.16347 0.81654 0.26073 0.08621 Time: 0.00068
17-03-25 15:13:06 [1] Train Extra: lr=0.0002149 inv=0.3943750 sub=0.0000000
17-03-25 15:14:22 [1] Step: 11700 Acc: 0.59156 0.82888 Cost: 1.24191 0.87557 0.28031 0.08603 Time: 0.00068
17-03-25 15:14:22 [1] Train Extra: lr=0.0002143 inv=0.4315625 sub=0.0000000
17-03-25 15:15:41 [1] Step: 11800 Acc: 0.58813 0.83109 Cost: 1.21826 1.00518 0.12724 0.08584 Time: 0.00070
17-03-25 15:15:41 [1] Train Extra: lr=0.0002136 inv=0.4484375 sub=0.0000000
17-03-25 15:17:07 [1] Step: 11900 Acc: 0.58844 0.84062 Cost: 1.06757 0.83167 0.15037 0.08552 Time: 0.00073
17-03-25 15:17:07 [1] Train Extra: lr=0.0002130 inv=0.4379688 sub=0.0000000
17-03-25 15:18:20 [1] Step: 12000 Acc: 0.57875 0.83226 Cost: 1.14466 0.87095 0.18839 0.08532 Time: 0.00068
17-03-25 15:18:20 [1] Train Extra: lr=0.0002124 inv=0.4118750 sub=0.0000000
17-03-25 15:19:09 [1] Step: 12000 Eval acc: 0.62025 0.84254 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00014
17-03-25 15:19:09 [1] Eval Extra: inv=0.3885822
17-03-25 15:20:21 [1] Step: 12100 Acc: 0.60469 0.82920 Cost: 1.12867 0.90348 0.14005 0.08514 Time: 0.00067
17-03-25 15:20:21 [1] Train Extra: lr=0.0002118 inv=0.4364062 sub=0.0000000
17-03-25 15:21:42 [1] Step: 12200 Acc: 0.59594 0.82837 Cost: 1.06250 0.87622 0.10137 0.08490 Time: 0.00070
17-03-25 15:21:42 [1] Train Extra: lr=0.0002112 inv=0.4628125 sub=0.0000000
17-03-25 15:22:51 [1] Step: 12300 Acc: 0.61594 0.84394 Cost: 1.12950 0.83380 0.21096 0.08474 Time: 0.00069
17-03-25 15:22:51 [1] Train Extra: lr=0.0002106 inv=0.4075000 sub=0.0000000
17-03-25 15:24:10 [1] Step: 12400 Acc: 0.58437 0.83830 Cost: 1.17165 0.87296 0.21412 0.08457 Time: 0.00072
17-03-25 15:24:10 [1] Train Extra: lr=0.0002100 inv=0.4028125 sub=0.0000000
17-03-25 15:25:25 [1] Step: 12500 Acc: 0.57969 0.83277 Cost: 1.06882 0.78233 0.20221 0.08428 Time: 0.00068
17-03-25 15:25:25 [1] Train Extra: lr=0.0002094 inv=0.4315625 sub=0.0000000
17-03-25 15:26:16 [1] Step: 12500 Eval acc: 0.61440 0.84210 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 15:26:16 [1] Eval Extra: inv=0.4304329
17-03-25 15:27:34 [1] Step: 12600 Acc: 0.59437 0.83696 Cost: 1.22966 0.97511 0.17045 0.08410 Time: 0.00072
17-03-25 15:27:34 [1] Train Extra: lr=0.0002088 inv=0.4064063 sub=0.0000000
17-03-25 15:28:47 [1] Step: 12700 Acc: 0.59813 0.83535 Cost: 0.85378 0.61798 0.15197 0.08383 Time: 0.00068
17-03-25 15:28:47 [1] Train Extra: lr=0.0002082 inv=0.4120313 sub=0.0000000
17-03-25 15:30:06 [1] Step: 12800 Acc: 0.59656 0.84152 Cost: 1.32792 0.86621 0.37808 0.08362 Time: 0.00072
17-03-25 15:30:06 [1] Train Extra: lr=0.0002076 inv=0.4170313 sub=0.0000000
17-03-25 15:31:20 [1] Step: 12900 Acc: 0.59906 0.83175 Cost: 1.11603 0.80512 0.22746 0.08345 Time: 0.00068
17-03-25 15:31:20 [1] Train Extra: lr=0.0002070 inv=0.4257812 sub=0.0000000
17-03-25 15:32:38 [1] Step: 13000 Acc: 0.61031 0.84778 Cost: 1.31657 0.98797 0.24532 0.08328 Time: 0.00071
17-03-25 15:32:38 [1] Train Extra: lr=0.0002064 inv=0.3970313 sub=0.0000000
17-03-25 15:33:32 [1] Step: 13000 Eval acc: 0.62114 0.83131 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 15:33:32 [1] Eval Extra: inv=0.4674801
17-03-25 15:34:46 [1] Step: 13100 Acc: 0.59531 0.83543 Cost: 1.03306 0.83837 0.11164 0.08305 Time: 0.00068
17-03-25 15:34:46 [1] Train Extra: lr=0.0002058 inv=0.4292187 sub=0.0000000
17-03-25 15:36:00 [1] Step: 13200 Acc: 0.60469 0.83635 Cost: 1.22594 0.97516 0.16790 0.08288 Time: 0.00069
17-03-25 15:36:00 [1] Train Extra: lr=0.0002052 inv=0.4082812 sub=0.0000000
17-03-25 15:37:27 [1] Step: 13300 Acc: 0.59313 0.83587 Cost: 1.19738 0.83430 0.28042 0.08266 Time: 0.00072
17-03-25 15:37:27 [1] Train Extra: lr=0.0002046 inv=0.4542188 sub=0.0000000
17-03-25 15:38:55 [1] Step: 13400 Acc: 0.59031 0.84141 Cost: 1.06632 0.78383 0.19997 0.08252 Time: 0.00072
17-03-25 15:38:55 [1] Train Extra: lr=0.0002040 inv=0.4350000 sub=0.0000000
17-03-25 15:40:06 [1] Step: 13500 Acc: 0.61313 0.84208 Cost: 1.15747 0.94083 0.13425 0.08240 Time: 0.00071
17-03-25 15:40:06 [1] Train Extra: lr=0.0002034 inv=0.3942188 sub=0.0000000
17-03-25 15:40:58 [1] Step: 13500 Eval acc: 0.62268 0.83859 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 15:40:58 [1] Eval Extra: inv=0.4188936
17-03-25 15:40:58 [1] Checkpointing with new best dev accuracy of 0.622681
17-03-25 15:42:17 [1] Step: 13600 Acc: 0.60094 0.83286 Cost: 0.99493 0.65318 0.25957 0.08218 Time: 0.00069
17-03-25 15:42:17 [1] Train Extra: lr=0.0002029 inv=0.4410938 sub=0.0000000
17-03-25 15:43:30 [1] Step: 13700 Acc: 0.60062 0.82808 Cost: 1.28107 0.96744 0.23168 0.08195 Time: 0.00066
17-03-25 15:43:30 [1] Train Extra: lr=0.0002023 inv=0.4420312 sub=0.0000000
17-03-25 15:44:45 [1] Step: 13800 Acc: 0.59156 0.83504 Cost: 1.28758 0.94652 0.25929 0.08177 Time: 0.00068
17-03-25 15:44:45 [1] Train Extra: lr=0.0002017 inv=0.4056250 sub=0.0000000
17-03-25 15:46:11 [1] Step: 13900 Acc: 0.59469 0.83635 Cost: 1.30355 0.96019 0.26177 0.08159 Time: 0.00071
17-03-25 15:46:11 [1] Train Extra: lr=0.0002011 inv=0.4185937 sub=0.0000000
17-03-25 15:47:17 [1] Step: 14000 Acc: 0.61062 0.83814 Cost: 0.98005 0.72649 0.17207 0.08149 Time: 0.00067
17-03-25 15:47:17 [1] Train Extra: lr=0.0002005 inv=0.3948437 sub=0.0000000
17-03-25 15:48:08 [1] Step: 14000 Eval acc: 0.61760 0.84014 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 15:48:08 [1] Eval Extra: inv=0.4321996
17-03-25 15:49:23 [1] Step: 14100 Acc: 0.60250 0.83605 Cost: 1.27420 0.92950 0.26340 0.08130 Time: 0.00068
17-03-25 15:49:23 [1] Train Extra: lr=0.0002000 inv=0.4103125 sub=0.0000000
17-03-25 15:50:42 [1] Step: 14200 Acc: 0.60531 0.83741 Cost: 1.14838 0.81278 0.25445 0.08115 Time: 0.00071
17-03-25 15:50:42 [1] Train Extra: lr=0.0001994 inv=0.4232812 sub=0.0000000
17-03-25 15:52:00 [1] Step: 14300 Acc: 0.60406 0.83502 Cost: 1.18411 0.81691 0.28629 0.08091 Time: 0.00071
17-03-25 15:52:00 [1] Train Extra: lr=0.0001988 inv=0.4135937 sub=0.0000000
17-03-25 15:53:15 [1] Step: 14400 Acc: 0.60969 0.83606 Cost: 1.07576 0.78732 0.20765 0.08079 Time: 0.00068
17-03-25 15:53:15 [1] Train Extra: lr=0.0001982 inv=0.4226563 sub=0.0000000
17-03-25 15:54:29 [1] Step: 14500 Acc: 0.59844 0.84006 Cost: 1.03082 0.79179 0.15837 0.08065 Time: 0.00069
17-03-25 15:54:29 [1] Train Extra: lr=0.0001977 inv=0.3992188 sub=0.0000000
17-03-25 15:55:20 [1] Step: 14500 Eval acc: 0.62688 0.84472 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 15:55:20 [1] Eval Extra: inv=0.3791409
17-03-25 15:55:20 [1] Checkpointing with new best dev accuracy of 0.626877
17-03-25 15:56:33 [1] Step: 14600 Acc: 0.60500 0.82692 Cost: 1.15464 0.85432 0.21971 0.08061 Time: 0.00067
17-03-25 15:56:33 [1] Train Extra: lr=0.0001971 inv=0.4229688 sub=0.0000000
17-03-25 15:57:55 [1] Step: 14700 Acc: 0.58156 0.83207 Cost: 1.12778 0.83496 0.21234 0.08048 Time: 0.00070
17-03-25 15:57:55 [1] Train Extra: lr=0.0001965 inv=0.4179688 sub=0.0000000
17-03-25 15:59:06 [1] Step: 14800 Acc: 0.60688 0.83631 Cost: 1.14292 0.85910 0.20347 0.08035 Time: 0.00070
17-03-25 15:59:06 [1] Train Extra: lr=0.0001960 inv=0.3890625 sub=0.0000000
17-03-25 16:00:20 [1] Step: 14900 Acc: 0.59062 0.84205 Cost: 1.26698 0.91245 0.27428 0.08025 Time: 0.00068
17-03-25 16:00:20 [1] Train Extra: lr=0.0001954 inv=0.4007812 sub=0.0000000
17-03-25 16:01:45 [1] Step: 15000 Acc: 0.60750 0.83982 Cost: 1.25532 0.96422 0.21103 0.08006 Time: 0.00073
17-03-25 16:01:45 [1] Train Extra: lr=0.0001949 inv=0.4081250 sub=0.0000000
17-03-25 16:02:37 [1] Step: 15000 Eval acc: 0.62500 0.83795 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 16:02:37 [1] Eval Extra: inv=0.4710689
17-03-25 16:02:37 [1] Checkpointing.
17-03-25 16:03:54 [1] Step: 15100 Acc: 0.60281 0.83722 Cost: 1.43477 1.07628 0.27860 0.07989 Time: 0.00068
17-03-25 16:03:54 [1] Train Extra: lr=0.0001943 inv=0.4307813 sub=0.0000000
17-03-25 16:05:12 [1] Step: 15200 Acc: 0.59188 0.83424 Cost: 1.15102 0.89505 0.17630 0.07968 Time: 0.00072
17-03-25 16:05:12 [1] Train Extra: lr=0.0001937 inv=0.4156250 sub=0.0000000
17-03-25 16:06:28 [1] Step: 15300 Acc: 0.58625 0.83922 Cost: 1.43235 1.07495 0.27791 0.07950 Time: 0.00068
17-03-25 16:06:28 [1] Train Extra: lr=0.0001932 inv=0.4457813 sub=0.0000000
17-03-25 16:07:42 [1] Step: 15400 Acc: 0.61719 0.83856 Cost: 1.13355 0.78342 0.27067 0.07946 Time: 0.00068
17-03-25 16:07:42 [1] Train Extra: lr=0.0001926 inv=0.4237500 sub=0.0000000
17-03-25 16:08:59 [1] Step: 15500 Acc: 0.59813 0.84347 Cost: 0.92009 0.70300 0.13776 0.07933 Time: 0.00072
17-03-25 16:08:59 [1] Train Extra: lr=0.0001921 inv=0.4001562 sub=0.0000000
17-03-25 16:09:51 [1] Step: 15500 Eval acc: 0.62544 0.84485 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 16:09:51 [1] Eval Extra: inv=0.3725707
17-03-25 16:11:04 [1] Step: 15600 Acc: 0.60969 0.83254 Cost: 1.18595 0.88727 0.21948 0.07920 Time: 0.00067
17-03-25 16:11:04 [1] Train Extra: lr=0.0001915 inv=0.4392187 sub=0.0000000
17-03-25 16:12:13 [1] Step: 15700 Acc: 0.61531 0.83120 Cost: 1.20270 0.88721 0.23646 0.07903 Time: 0.00065
17-03-25 16:12:13 [1] Train Extra: lr=0.0001910 inv=0.4359375 sub=0.0000000
17-03-25 16:13:39 [1] Step: 15800 Acc: 0.58594 0.84376 Cost: 0.96134 0.77767 0.10477 0.07891 Time: 0.00073
17-03-25 16:13:39 [1] Train Extra: lr=0.0001904 inv=0.4039063 sub=0.0000000
17-03-25 16:14:58 [1] Step: 15900 Acc: 0.61313 0.83731 Cost: 1.26342 0.90049 0.28419 0.07875 Time: 0.00070
17-03-25 16:14:58 [1] Train Extra: lr=0.0001899 inv=0.4140625 sub=0.0000000
17-03-25 16:16:17 [1] Step: 16000 Acc: 0.60156 0.83102 Cost: 1.08575 0.75716 0.24994 0.07865 Time: 0.00068
17-03-25 16:16:17 [1] Train Extra: lr=0.0001893 inv=0.4306250 sub=0.0000000
17-03-25 16:17:09 [1] Step: 16000 Eval acc: 0.62114 0.84406 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 16:17:09 [1] Eval Extra: inv=0.3913980
17-03-25 16:18:28 [1] Step: 16100 Acc: 0.60813 0.83524 Cost: 1.27643 0.90582 0.29206 0.07856 Time: 0.00070
17-03-25 16:18:28 [1] Train Extra: lr=0.0001888 inv=0.4276563 sub=0.0000000
17-03-25 16:19:40 [1] Step: 16200 Acc: 0.60625 0.83632 Cost: 1.24308 0.91166 0.25289 0.07854 Time: 0.00068
17-03-25 16:19:40 [1] Train Extra: lr=0.0001882 inv=0.4193750 sub=0.0000000
17-03-25 16:20:55 [1] Step: 16300 Acc: 0.62469 0.82921 Cost: 0.91452 0.64540 0.19066 0.07846 Time: 0.00067
17-03-25 16:20:55 [1] Train Extra: lr=0.0001877 inv=0.4267187 sub=0.0000000
17-03-25 16:22:15 [1] Step: 16400 Acc: 0.58594 0.84182 Cost: 1.26027 0.93639 0.24548 0.07841 Time: 0.00073
17-03-25 16:22:15 [1] Train Extra: lr=0.0001872 inv=0.3865625 sub=0.0000000
17-03-25 16:23:28 [1] Step: 16500 Acc: 0.60562 0.83616 Cost: 1.13747 0.91647 0.14279 0.07821 Time: 0.00069
17-03-25 16:23:28 [1] Train Extra: lr=0.0001866 inv=0.4026562 sub=0.0000000
17-03-25 16:24:19 [1] Step: 16500 Eval acc: 0.62511 0.83945 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 16:24:19 [1] Eval Extra: inv=0.3864289
17-03-25 16:25:38 [1] Step: 16600 Acc: 0.62844 0.84045 Cost: 1.08830 0.86515 0.14505 0.07810 Time: 0.00071
17-03-25 16:25:38 [1] Train Extra: lr=0.0001861 inv=0.4012500 sub=0.0000000
17-03-25 16:26:58 [1] Step: 16700 Acc: 0.61687 0.84827 Cost: 1.52178 1.09599 0.34778 0.07801 Time: 0.00073
17-03-25 16:26:58 [1] Train Extra: lr=0.0001856 inv=0.4017188 sub=0.0000000
17-03-25 16:28:21 [1] Step: 16800 Acc: 0.61375 0.84152 Cost: 1.08395 0.75745 0.24857 0.07792 Time: 0.00072
17-03-25 16:28:21 [1] Train Extra: lr=0.0001850 inv=0.4618750 sub=0.0000000
17-03-25 16:29:53 [1] Step: 16900 Acc: 0.61562 0.83959 Cost: 0.85851 0.64273 0.13783 0.07795 Time: 0.00074
17-03-25 16:29:53 [1] Train Extra: lr=0.0001845 inv=0.4531250 sub=0.0000000
17-03-25 16:31:07 [1] Step: 17000 Acc: 0.62313 0.84116 Cost: 1.25287 0.85486 0.32009 0.07792 Time: 0.00069
17-03-25 16:31:07 [1] Train Extra: lr=0.0001840 inv=0.4193750 sub=0.0000000
17-03-25 16:31:59 [1] Step: 17000 Eval acc: 0.63571 0.84612 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 16:31:59 [1] Eval Extra: inv=0.4496466
17-03-25 16:31:59 [1] Checkpointing with new best dev accuracy of 0.635711
17-03-25 16:33:19 [1] Step: 17100 Acc: 0.61906 0.84137 Cost: 1.51941 1.08453 0.35699 0.07789 Time: 0.00071
17-03-25 16:33:19 [1] Train Extra: lr=0.0001834 inv=0.4226563 sub=0.0000000
17-03-25 16:34:32 [1] Step: 17200 Acc: 0.61844 0.84038 Cost: 1.00088 0.74989 0.17302 0.07798 Time: 0.00070
17-03-25 16:34:32 [1] Train Extra: lr=0.0001829 inv=0.4068750 sub=0.0000000
17-03-25 16:35:45 [1] Step: 17300 Acc: 0.62969 0.84025 Cost: 1.44709 1.10277 0.26628 0.07804 Time: 0.00068
17-03-25 16:35:45 [1] Train Extra: lr=0.0001824 inv=0.4368750 sub=0.0000000
17-03-25 16:37:00 [1] Step: 17400 Acc: 0.61875 0.83989 Cost: 1.35758 1.00691 0.27265 0.07802 Time: 0.00068
17-03-25 16:37:00 [1] Train Extra: lr=0.0001819 inv=0.3996875 sub=0.0000000
17-03-25 16:38:24 [1] Step: 17500 Acc: 0.62031 0.84282 Cost: 1.49083 1.07693 0.33580 0.07810 Time: 0.00073
17-03-25 16:38:24 [1] Train Extra: lr=0.0001813 inv=0.4292187 sub=0.0000000
17-03-25 16:39:16 [1] Step: 17500 Eval acc: 0.63052 0.84473 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 16:39:16 [1] Eval Extra: inv=0.3111749
17-03-25 16:40:29 [1] Step: 17600 Acc: 0.61250 0.84457 Cost: 1.34880 0.99519 0.27559 0.07801 Time: 0.00071
17-03-25 16:40:29 [1] Train Extra: lr=0.0001808 inv=0.3790625 sub=0.0000000
17-03-25 16:41:45 [1] Step: 17700 Acc: 0.61156 0.83306 Cost: 0.89540 0.66363 0.15388 0.07788 Time: 0.00068
17-03-25 16:41:45 [1] Train Extra: lr=0.0001803 inv=0.4287500 sub=0.0000000
17-03-25 16:43:05 [1] Step: 17800 Acc: 0.62219 0.84685 Cost: 1.41800 1.08747 0.25263 0.07791 Time: 0.00072
17-03-25 16:43:05 [1] Train Extra: lr=0.0001798 inv=0.4132812 sub=0.0000000
17-03-25 16:44:25 [1] Step: 17900 Acc: 0.61250 0.84174 Cost: 0.99743 0.70309 0.21645 0.07790 Time: 0.00071
17-03-25 16:44:25 [1] Train Extra: lr=0.0001793 inv=0.4095313 sub=0.0000000
17-03-25 16:45:52 [1] Step: 18000 Acc: 0.62906 0.83865 Cost: 0.89924 0.60101 0.22032 0.07791 Time: 0.00072
17-03-25 16:45:52 [1] Train Extra: lr=0.0001787 inv=0.4465625 sub=0.0000000
17-03-25 16:46:44 [1] Step: 18000 Eval acc: 0.63748 0.84785 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 16:46:44 [1] Eval Extra: inv=0.4236418
17-03-25 16:47:52 [1] Step: 18100 Acc: 0.63375 0.84133 Cost: 0.95782 0.74121 0.13874 0.07788 Time: 0.00068
17-03-25 16:47:52 [1] Train Extra: lr=0.0001782 inv=0.3753125 sub=0.0000000
17-03-25 16:49:17 [1] Step: 18200 Acc: 0.62562 0.83900 Cost: 1.28377 0.97372 0.23220 0.07784 Time: 0.00071
17-03-25 16:49:17 [1] Train Extra: lr=0.0001777 inv=0.4700000 sub=0.0000000
17-03-25 16:50:44 [1] Step: 18300 Acc: 0.61406 0.84691 Cost: 1.19305 0.92772 0.18758 0.07775 Time: 0.00074
17-03-25 16:50:44 [1] Train Extra: lr=0.0001772 inv=0.4078125 sub=0.0000000
17-03-25 16:52:02 [1] Step: 18400 Acc: 0.62656 0.84233 Cost: 1.25799 0.87329 0.30702 0.07769 Time: 0.00071
17-03-25 16:52:02 [1] Train Extra: lr=0.0001767 inv=0.4060937 sub=0.0000000
17-03-25 16:53:18 [1] Step: 18500 Acc: 0.62250 0.84979 Cost: 0.97314 0.72886 0.16660 0.07768 Time: 0.00070
17-03-25 16:53:18 [1] Train Extra: lr=0.0001762 inv=0.3976562 sub=0.0000000
17-03-25 16:54:09 [1] Step: 18500 Eval acc: 0.63328 0.83935 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 16:54:09 [1] Eval Extra: inv=0.4201634
17-03-25 16:55:27 [1] Step: 18600 Acc: 0.62906 0.84365 Cost: 1.22712 0.93469 0.21470 0.07772 Time: 0.00072
17-03-25 16:55:27 [1] Train Extra: lr=0.0001757 inv=0.4051562 sub=0.0000000
17-03-25 16:56:41 [1] Step: 18700 Acc: 0.61750 0.83664 Cost: 0.95138 0.77616 0.09744 0.07778 Time: 0.00069
17-03-25 16:56:41 [1] Train Extra: lr=0.0001752 inv=0.4120313 sub=0.0000000
17-03-25 16:58:01 [1] Step: 18800 Acc: 0.63313 0.84039 Cost: 1.11721 0.81430 0.22514 0.07777 Time: 0.00070
17-03-25 16:58:01 [1] Train Extra: lr=0.0001747 inv=0.4390625 sub=0.0000000
17-03-25 16:59:13 [1] Step: 18900 Acc: 0.62156 0.84398 Cost: 1.31584 0.90801 0.33009 0.07774 Time: 0.00070
17-03-25 16:59:13 [1] Train Extra: lr=0.0001742 inv=0.4059375 sub=0.0000000
17-03-25 17:00:27 [1] Step: 19000 Acc: 0.62500 0.83984 Cost: 1.28032 0.93890 0.26381 0.07762 Time: 0.00069
17-03-25 17:00:27 [1] Train Extra: lr=0.0001737 inv=0.4018750 sub=0.0000000
17-03-25 17:01:19 [1] Step: 19000 Eval acc: 0.64245 0.84652 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 17:01:19 [1] Eval Extra: inv=0.4048697
17-03-25 17:01:19 [1] Checkpointing with new best dev accuracy of 0.642447
17-03-25 17:02:26 [1] Step: 19100 Acc: 0.62125 0.84227 Cost: 1.03041 0.78368 0.16915 0.07759 Time: 0.00068
17-03-25 17:02:26 [1] Train Extra: lr=0.0001732 inv=0.3920312 sub=0.0000000
17-03-25 17:03:35 [1] Step: 19200 Acc: 0.62156 0.83716 Cost: 1.12416 0.77916 0.26744 0.07757 Time: 0.00066
17-03-25 17:03:35 [1] Train Extra: lr=0.0001727 inv=0.3956250 sub=0.0000000
17-03-25 17:04:56 [1] Step: 19300 Acc: 0.61313 0.83780 Cost: 1.10056 0.83309 0.19001 0.07746 Time: 0.00071
17-03-25 17:04:56 [1] Train Extra: lr=0.0001722 inv=0.3959375 sub=0.0000000
17-03-25 17:06:08 [1] Step: 19400 Acc: 0.63500 0.83552 Cost: 0.94872 0.65852 0.21276 0.07744 Time: 0.00068
17-03-25 17:06:08 [1] Train Extra: lr=0.0001717 inv=0.4196875 sub=0.0000000
17-03-25 17:07:23 [1] Step: 19500 Acc: 0.62562 0.83579 Cost: 1.45837 1.08561 0.29526 0.07750 Time: 0.00068
17-03-25 17:07:23 [1] Train Extra: lr=0.0001712 inv=0.4248438 sub=0.0000000
17-03-25 17:08:15 [1] Step: 19500 Eval acc: 0.63527 0.83890 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 17:08:15 [1] Eval Extra: inv=0.4404814
17-03-25 17:09:28 [1] Step: 19600 Acc: 0.62313 0.83853 Cost: 1.19117 0.90479 0.20888 0.07750 Time: 0.00068
17-03-25 17:09:28 [1] Train Extra: lr=0.0001707 inv=0.4346875 sub=0.0000000
17-03-25 17:10:42 [1] Step: 19700 Acc: 0.61313 0.83753 Cost: 1.13027 0.86574 0.18707 0.07746 Time: 0.00067
17-03-25 17:10:42 [1] Train Extra: lr=0.0001702 inv=0.4267187 sub=0.0000000
17-03-25 17:11:56 [1] Step: 19800 Acc: 0.63687 0.83701 Cost: 1.20297 0.81725 0.30826 0.07746 Time: 0.00068
17-03-25 17:11:56 [1] Train Extra: lr=0.0001697 inv=0.4210937 sub=0.0000000
17-03-25 17:13:14 [1] Step: 19900 Acc: 0.61750 0.84193 Cost: 0.89906 0.66419 0.15741 0.07746 Time: 0.00071
17-03-25 17:13:14 [1] Train Extra: lr=0.0001692 inv=0.4042188 sub=0.0000000
17-03-25 17:14:23 [1] Step: 20000 Acc: 0.62781 0.83931 Cost: 0.99548 0.72637 0.19169 0.07742 Time: 0.00066
17-03-25 17:14:23 [1] Train Extra: lr=0.0001687 inv=0.4218750 sub=0.0000000
17-03-25 17:15:15 [1] Step: 20000 Eval acc: 0.64554 0.84168 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 17:15:15 [1] Eval Extra: inv=0.4159673
17-03-25 17:15:15 [1] Checkpointing.
17-03-25 17:16:30 [1] Step: 20100 Acc: 0.62250 0.84411 Cost: 1.13013 0.79036 0.26236 0.07740 Time: 0.00069
17-03-25 17:16:30 [1] Train Extra: lr=0.0001683 inv=0.4064063 sub=0.0000000
17-03-25 17:17:43 [1] Step: 20200 Acc: 0.62438 0.83789 Cost: 1.02816 0.76562 0.18515 0.07739 Time: 0.00067
17-03-25 17:17:43 [1] Train Extra: lr=0.0001678 inv=0.4385938 sub=0.0000000
17-03-25 17:18:54 [1] Step: 20300 Acc: 0.64062 0.84275 Cost: 0.99488 0.77442 0.14302 0.07744 Time: 0.00070
17-03-25 17:18:54 [1] Train Extra: lr=0.0001673 inv=0.3909375 sub=0.0000000
17-03-25 17:20:14 [1] Step: 20400 Acc: 0.63000 0.83644 Cost: 1.19871 0.85517 0.26610 0.07743 Time: 0.00070
17-03-25 17:20:14 [1] Train Extra: lr=0.0001668 inv=0.4290625 sub=0.0000000
17-03-25 17:21:29 [1] Step: 20500 Acc: 0.61531 0.83784 Cost: 1.20403 0.91096 0.21566 0.07742 Time: 0.00067
17-03-25 17:21:29 [1] Train Extra: lr=0.0001663 inv=0.4485938 sub=0.0000000
17-03-25 17:22:21 [1] Step: 20500 Eval acc: 0.64808 0.84785 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 17:22:21 [1] Eval Extra: inv=0.4005080
17-03-25 17:22:21 [1] Checkpointing with new best dev accuracy of 0.648079
17-03-25 17:23:40 [1] Step: 20600 Acc: 0.62750 0.84456 Cost: 1.04768 0.81549 0.15480 0.07739 Time: 0.00071
17-03-25 17:23:40 [1] Train Extra: lr=0.0001659 inv=0.3995313 sub=0.0000000
17-03-25 17:24:59 [1] Step: 20700 Acc: 0.62938 0.84029 Cost: 1.01444 0.78789 0.14922 0.07733 Time: 0.00070
17-03-25 17:24:59 [1] Train Extra: lr=0.0001654 inv=0.4178125 sub=0.0000000
17-03-25 17:26:06 [1] Step: 20800 Acc: 0.60969 0.83755 Cost: 1.20069 0.87003 0.25335 0.07731 Time: 0.00067
17-03-25 17:26:06 [1] Train Extra: lr=0.0001649 inv=0.3920312 sub=0.0000000
17-03-25 17:27:27 [1] Step: 20900 Acc: 0.61750 0.83894 Cost: 0.94127 0.66732 0.19672 0.07722 Time: 0.00070
17-03-25 17:27:27 [1] Train Extra: lr=0.0001644 inv=0.4064063 sub=0.0000000
17-03-25 17:28:43 [1] Step: 21000 Acc: 0.62594 0.84539 Cost: 1.43994 1.05845 0.30430 0.07719 Time: 0.00069
17-03-25 17:28:43 [1] Train Extra: lr=0.0001640 inv=0.4071875 sub=0.0000000
17-03-25 17:29:38 [1] Step: 21000 Eval acc: 0.65150 0.84864 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 17:29:38 [1] Eval Extra: inv=0.3869810
17-03-25 17:30:53 [1] Step: 21100 Acc: 0.62094 0.84220 Cost: 1.17386 0.85322 0.24352 0.07712 Time: 0.00069
17-03-25 17:30:53 [1] Train Extra: lr=0.0001635 inv=0.4109375 sub=0.0000000
17-03-25 17:32:11 [1] Step: 21200 Acc: 0.62687 0.83702 Cost: 1.07310 0.80753 0.18846 0.07712 Time: 0.00070
17-03-25 17:32:11 [1] Train Extra: lr=0.0001630 inv=0.4573437 sub=0.0000000
17-03-25 17:33:20 [1] Step: 21300 Acc: 0.63187 0.84299 Cost: 1.21056 0.83359 0.29992 0.07704 Time: 0.00067
17-03-25 17:33:20 [1] Train Extra: lr=0.0001626 inv=0.3976562 sub=0.0000000
17-03-25 17:34:33 [1] Step: 21400 Acc: 0.61031 0.84132 Cost: 1.27198 0.90157 0.29343 0.07698 Time: 0.00068
17-03-25 17:34:33 [1] Train Extra: lr=0.0001621 inv=0.4232812 sub=0.0000000
17-03-25 17:36:00 [1] Step: 21500 Acc: 0.63687 0.85201 Cost: 1.24011 0.95858 0.20450 0.07702 Time: 0.00074
17-03-25 17:36:00 [1] Train Extra: lr=0.0001616 inv=0.4351563 sub=0.0000000
17-03-25 17:36:52 [1] Step: 21500 Eval acc: 0.64543 0.84372 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 17:36:52 [1] Eval Extra: inv=0.4082376
17-03-25 17:38:06 [1] Step: 21600 Acc: 0.63906 0.84810 Cost: 0.93692 0.70566 0.15424 0.07701 Time: 0.00071
17-03-25 17:38:06 [1] Train Extra: lr=0.0001612 inv=0.3875000 sub=0.0000000
17-03-25 17:39:25 [1] Step: 21700 Acc: 0.62313 0.83121 Cost: 1.23248 0.85994 0.29548 0.07706 Time: 0.00069
17-03-25 17:39:25 [1] Train Extra: lr=0.0001607 inv=0.4425000 sub=0.0000000
17-03-25 17:40:39 [1] Step: 21800 Acc: 0.62344 0.83512 Cost: 1.04868 0.74290 0.22881 0.07698 Time: 0.00068
17-03-25 17:40:39 [1] Train Extra: lr=0.0001602 inv=0.4362500 sub=0.0000000
17-03-25 17:41:53 [1] Step: 21900 Acc: 0.63344 0.84875 Cost: 1.19551 0.92365 0.19489 0.07698 Time: 0.00070
17-03-25 17:41:53 [1] Train Extra: lr=0.0001598 inv=0.3953125 sub=0.0000000
17-03-25 17:43:18 [1] Step: 22000 Acc: 0.61687 0.84804 Cost: 1.20139 0.95822 0.16615 0.07702 Time: 0.00074
17-03-25 17:43:18 [1] Train Extra: lr=0.0001593 inv=0.4304688 sub=0.0000000
17-03-25 17:44:10 [1] Step: 22000 Eval acc: 0.64587 0.84280 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 17:44:10 [1] Eval Extra: inv=0.4150839
17-03-25 17:45:30 [1] Step: 22100 Acc: 0.63062 0.84224 Cost: 0.97816 0.69312 0.20808 0.07696 Time: 0.00071
17-03-25 17:45:30 [1] Train Extra: lr=0.0001589 inv=0.4217187 sub=0.0000000
17-03-25 17:46:50 [1] Step: 22200 Acc: 0.63062 0.84055 Cost: 0.94920 0.73606 0.13623 0.07691 Time: 0.00070
17-03-25 17:46:50 [1] Train Extra: lr=0.0001584 inv=0.4165625 sub=0.0000000
17-03-25 17:48:04 [1] Step: 22300 Acc: 0.62906 0.83727 Cost: 1.18992 0.79347 0.31956 0.07689 Time: 0.00068
17-03-25 17:48:04 [1] Train Extra: lr=0.0001579 inv=0.4156250 sub=0.0000000
17-03-25 17:49:16 [1] Step: 22400 Acc: 0.61687 0.84068 Cost: 1.06391 0.72815 0.25888 0.07688 Time: 0.00069
17-03-25 17:49:16 [1] Train Extra: lr=0.0001575 inv=0.4198438 sub=0.0000000
17-03-25 17:50:38 [1] Step: 22500 Acc: 0.62438 0.84583 Cost: 1.16055 0.84808 0.23559 0.07687 Time: 0.00071
17-03-25 17:50:38 [1] Train Extra: lr=0.0001570 inv=0.4367187 sub=0.0000000
17-03-25 17:51:30 [1] Step: 22500 Eval acc: 0.64532 0.84713 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 17:51:30 [1] Eval Extra: inv=0.3958149
17-03-25 17:52:36 [1] Step: 22600 Acc: 0.62844 0.84439 Cost: 1.17953 0.85129 0.25144 0.07680 Time: 0.00068
17-03-25 17:52:36 [1] Train Extra: lr=0.0001566 inv=0.4217187 sub=0.0000000
17-03-25 17:53:50 [1] Step: 22700 Acc: 0.62062 0.83586 Cost: 1.02348 0.74078 0.20591 0.07679 Time: 0.00068
17-03-25 17:53:50 [1] Train Extra: lr=0.0001561 inv=0.4456250 sub=0.0000000
17-03-25 17:55:02 [1] Step: 22800 Acc: 0.63625 0.84649 Cost: 1.05743 0.92038 0.06033 0.07672 Time: 0.00070
17-03-25 17:55:02 [1] Train Extra: lr=0.0001557 inv=0.4048438 sub=0.0000000
17-03-25 17:56:13 [1] Step: 22900 Acc: 0.62562 0.84048 Cost: 1.48465 1.13090 0.27696 0.07678 Time: 0.00066
17-03-25 17:56:13 [1] Train Extra: lr=0.0001552 inv=0.4342187 sub=0.0000000
17-03-25 17:57:31 [1] Step: 23000 Acc: 0.61594 0.84609 Cost: 1.16054 0.90146 0.18237 0.07671 Time: 0.00072
17-03-25 17:57:31 [1] Train Extra: lr=0.0001548 inv=0.4267187 sub=0.0000000
17-03-25 17:58:23 [1] Step: 23000 Eval acc: 0.64764 0.84846 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 17:58:23 [1] Eval Extra: inv=0.3538538
17-03-25 17:59:36 [1] Step: 23100 Acc: 0.62906 0.84037 Cost: 1.11208 0.81164 0.22375 0.07668 Time: 0.00069
17-03-25 17:59:36 [1] Train Extra: lr=0.0001544 inv=0.4103125 sub=0.0000000
17-03-25 18:01:02 [1] Step: 23200 Acc: 0.62625 0.84001 Cost: 0.93323 0.84085 0.01572 0.07666 Time: 0.00072
17-03-25 18:01:02 [1] Train Extra: lr=0.0001539 inv=0.4410938 sub=0.0000000
17-03-25 18:02:18 [1] Step: 23300 Acc: 0.63344 0.84602 Cost: 1.12980 0.78374 0.26929 0.07677 Time: 0.00069
17-03-25 18:02:18 [1] Train Extra: lr=0.0001535 inv=0.4187500 sub=0.0000000
17-03-25 18:03:30 [1] Step: 23400 Acc: 0.63750 0.83590 Cost: 1.11474 0.79188 0.24606 0.07681 Time: 0.00068
17-03-25 18:03:30 [1] Train Extra: lr=0.0001530 inv=0.4225000 sub=0.0000000
17-03-25 18:04:50 [1] Step: 23500 Acc: 0.62687 0.83462 Cost: 1.17644 0.87507 0.22461 0.07676 Time: 0.00069
17-03-25 18:04:50 [1] Train Extra: lr=0.0001526 inv=0.4400000 sub=0.0000000
17-03-25 18:05:42 [1] Step: 23500 Eval acc: 0.65139 0.84950 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 18:05:42 [1] Eval Extra: inv=0.3912323
17-03-25 18:06:55 [1] Step: 23600 Acc: 0.63125 0.84034 Cost: 1.12805 0.79036 0.26091 0.07678 Time: 0.00069
17-03-25 18:06:55 [1] Train Extra: lr=0.0001521 inv=0.4090625 sub=0.0000000
17-03-25 18:08:16 [1] Step: 23700 Acc: 0.61344 0.84292 Cost: 1.17234 0.77596 0.31966 0.07672 Time: 0.00069
17-03-25 18:08:16 [1] Train Extra: lr=0.0001517 inv=0.4382813 sub=0.0000000
17-03-25 18:09:30 [1] Step: 23800 Acc: 0.63938 0.84747 Cost: 1.17953 1.08116 0.02164 0.07673 Time: 0.00070
17-03-25 18:09:30 [1] Train Extra: lr=0.0001513 inv=0.3926562 sub=0.0000000
17-03-25 18:10:49 [1] Step: 23900 Acc: 0.62687 0.84396 Cost: 0.89466 0.72708 0.09076 0.07683 Time: 0.00072
17-03-25 18:10:49 [1] Train Extra: lr=0.0001508 inv=0.3970313 sub=0.0000000
17-03-25 18:12:09 [1] Step: 24000 Acc: 0.62000 0.84509 Cost: 1.05497 0.74149 0.23667 0.07681 Time: 0.00071
17-03-25 18:12:09 [1] Train Extra: lr=0.0001504 inv=0.4407813 sub=0.0000000
17-03-25 18:13:01 [1] Step: 24000 Eval acc: 0.65216 0.84882 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 18:13:01 [1] Eval Extra: inv=0.4120473
17-03-25 18:13:01 [1] Checkpointing with new best dev accuracy of 0.652164
17-03-25 18:14:13 [1] Step: 24100 Acc: 0.63094 0.83516 Cost: 1.27826 0.99216 0.20931 0.07679 Time: 0.00068
17-03-25 18:14:13 [1] Train Extra: lr=0.0001500 inv=0.4504687 sub=0.0000000
17-03-25 18:15:33 [1] Step: 24200 Acc: 0.64000 0.84162 Cost: 1.11638 0.85522 0.18443 0.07673 Time: 0.00071
17-03-25 18:15:33 [1] Train Extra: lr=0.0001495 inv=0.4192187 sub=0.0000000
17-03-25 18:16:54 [1] Step: 24300 Acc: 0.61719 0.84188 Cost: 0.98979 0.78182 0.13125 0.07671 Time: 0.00070
17-03-25 18:16:54 [1] Train Extra: lr=0.0001491 inv=0.4293750 sub=0.0000000
17-03-25 18:18:14 [1] Step: 24400 Acc: 0.62500 0.84474 Cost: 1.15831 0.92665 0.15495 0.07671 Time: 0.00071
17-03-25 18:18:14 [1] Train Extra: lr=0.0001487 inv=0.4354688 sub=0.0000000
17-03-25 18:19:28 [1] Step: 24500 Acc: 0.63344 0.83922 Cost: 1.01852 0.73486 0.20692 0.07674 Time: 0.00068
17-03-25 18:19:28 [1] Train Extra: lr=0.0001483 inv=0.4245313 sub=0.0000000
17-03-25 18:20:19 [1] Step: 24500 Eval acc: 0.65051 0.84858 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 18:20:19 [1] Eval Extra: inv=0.4302120
17-03-25 18:21:43 [1] Step: 24600 Acc: 0.62969 0.84190 Cost: 1.01834 0.66280 0.27879 0.07675 Time: 0.00073
17-03-25 18:21:43 [1] Train Extra: lr=0.0001478 inv=0.4329688 sub=0.0000000
17-03-25 18:23:03 [1] Step: 24700 Acc: 0.63906 0.83872 Cost: 0.85525 0.64951 0.12895 0.07680 Time: 0.00070
17-03-25 18:23:03 [1] Train Extra: lr=0.0001474 inv=0.4375000 sub=0.0000000
17-03-25 18:24:23 [1] Step: 24800 Acc: 0.63000 0.83724 Cost: 1.04096 0.80579 0.15844 0.07674 Time: 0.00070
17-03-25 18:24:23 [1] Train Extra: lr=0.0001470 inv=0.4228125 sub=0.0000000
17-03-25 18:25:45 [1] Step: 24900 Acc: 0.62562 0.83437 Cost: 1.00546 0.73295 0.19578 0.07672 Time: 0.00068
17-03-25 18:25:45 [1] Train Extra: lr=0.0001466 inv=0.4520312 sub=0.0000000
17-03-25 18:26:58 [1] Step: 25000 Acc: 0.63031 0.84017 Cost: 1.00294 0.67794 0.24827 0.07673 Time: 0.00070
17-03-25 18:26:58 [1] Train Extra: lr=0.0001461 inv=0.4037500 sub=0.0000000
17-03-25 18:27:50 [1] Step: 25000 Eval acc: 0.64145 0.84235 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 18:27:50 [1] Eval Extra: inv=0.3966983
17-03-25 18:27:50 [1] Checkpointing.
17-03-25 18:29:06 [1] Step: 25100 Acc: 0.63219 0.83413 Cost: 1.05284 0.77112 0.20492 0.07679 Time: 0.00067
17-03-25 18:29:06 [1] Train Extra: lr=0.0001457 inv=0.4328125 sub=0.0000000
17-03-25 18:30:27 [1] Step: 25200 Acc: 0.62906 0.84142 Cost: 1.27031 0.89224 0.30128 0.07679 Time: 0.00072
17-03-25 18:30:27 [1] Train Extra: lr=0.0001453 inv=0.4434375 sub=0.0000000
17-03-25 18:31:47 [1] Step: 25300 Acc: 0.64344 0.84018 Cost: 1.02524 0.76344 0.18494 0.07686 Time: 0.00070
17-03-25 18:31:47 [1] Train Extra: lr=0.0001449 inv=0.4367187 sub=0.0000000
17-03-25 18:33:03 [1] Step: 25400 Acc: 0.64625 0.84179 Cost: 1.01608 0.71848 0.22068 0.07692 Time: 0.00069
17-03-25 18:33:03 [1] Train Extra: lr=0.0001445 inv=0.3984375 sub=0.0000000
17-03-25 18:34:14 [1] Step: 25500 Acc: 0.63187 0.84465 Cost: 0.95462 0.79972 0.07783 0.07706 Time: 0.00069
17-03-25 18:34:14 [1] Train Extra: lr=0.0001441 inv=0.4132812 sub=0.0000000
17-03-25 18:35:06 [1] Step: 25500 Eval acc: 0.64775 0.84752 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 18:35:06 [1] Eval Extra: inv=0.3906250
17-03-25 18:36:20 [1] Step: 25600 Acc: 0.63781 0.84799 Cost: 1.18731 0.89271 0.21758 0.07702 Time: 0.00070
17-03-25 18:36:20 [1] Train Extra: lr=0.0001436 inv=0.4050000 sub=0.0000000
17-03-25 18:37:36 [1] Step: 25700 Acc: 0.64000 0.84160 Cost: 1.17072 0.84958 0.24402 0.07712 Time: 0.00068
17-03-25 18:37:36 [1] Train Extra: lr=0.0001432 inv=0.4446875 sub=0.0000000
17-03-25 18:38:56 [1] Step: 25800 Acc: 0.63875 0.85379 Cost: 1.25123 0.80214 0.37189 0.07720 Time: 0.00074
17-03-25 18:38:56 [1] Train Extra: lr=0.0001428 inv=0.4129687 sub=0.0000000
17-03-25 18:40:20 [1] Step: 25900 Acc: 0.64469 0.84481 Cost: 1.02788 0.72522 0.22549 0.07717 Time: 0.00074
17-03-25 18:40:20 [1] Train Extra: lr=0.0001424 inv=0.4137500 sub=0.0000000
17-03-25 18:41:36 [1] Step: 26000 Acc: 0.63656 0.84643 Cost: 1.11565 0.79361 0.24478 0.07727 Time: 0.00069
17-03-25 18:41:36 [1] Train Extra: lr=0.0001420 inv=0.4310937 sub=0.0000000
17-03-25 18:42:31 [1] Step: 26000 Eval acc: 0.65459 0.84724 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 18:42:31 [1] Eval Extra: inv=0.4517999
17-03-25 18:43:53 [1] Step: 26100 Acc: 0.63219 0.84633 Cost: 1.27611 0.84240 0.35638 0.07733 Time: 0.00070
17-03-25 18:43:53 [1] Train Extra: lr=0.0001416 inv=0.4254688 sub=0.0000000
17-03-25 18:45:08 [1] Step: 26200 Acc: 0.65281 0.83519 Cost: 1.21405 0.93631 0.20029 0.07744 Time: 0.00069
17-03-25 18:45:08 [1] Train Extra: lr=0.0001412 inv=0.4443750 sub=0.0000000
17-03-25 18:46:25 [1] Step: 26300 Acc: 0.63125 0.84275 Cost: 1.39875 1.08091 0.24032 0.07751 Time: 0.00070
17-03-25 18:46:25 [1] Train Extra: lr=0.0001408 inv=0.4270312 sub=0.0000000
17-03-25 18:47:44 [1] Step: 26400 Acc: 0.64344 0.83600 Cost: 1.11993 0.82173 0.22061 0.07759 Time: 0.00071
17-03-25 18:47:44 [1] Train Extra: lr=0.0001404 inv=0.4156250 sub=0.0000000
17-03-25 18:48:53 [1] Step: 26500 Acc: 0.64312 0.84473 Cost: 1.36750 1.04679 0.24308 0.07763 Time: 0.00069
17-03-25 18:48:53 [1] Train Extra: lr=0.0001400 inv=0.3889063 sub=0.0000000
17-03-25 18:49:46 [1] Step: 26500 Eval acc: 0.64852 0.84275 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 18:49:46 [1] Eval Extra: inv=0.4458922
17-03-25 18:51:07 [1] Step: 26600 Acc: 0.64750 0.84265 Cost: 0.97342 0.69221 0.20350 0.07772 Time: 0.00071
17-03-25 18:51:07 [1] Train Extra: lr=0.0001396 inv=0.4051562 sub=0.0000000
17-03-25 18:52:21 [1] Step: 26700 Acc: 0.65531 0.83623 Cost: 1.17159 0.79015 0.30374 0.07770 Time: 0.00068
17-03-25 18:52:21 [1] Train Extra: lr=0.0001392 inv=0.4428125 sub=0.0000000
17-03-25 18:53:42 [1] Step: 26800 Acc: 0.65938 0.84634 Cost: 1.17714 0.89527 0.20414 0.07772 Time: 0.00072
17-03-25 18:53:42 [1] Train Extra: lr=0.0001388 inv=0.4162500 sub=0.0000000
17-03-25 18:54:52 [1] Step: 26900 Acc: 0.63687 0.84468 Cost: 1.00003 0.67707 0.24513 0.07784 Time: 0.00067
17-03-25 18:54:52 [1] Train Extra: lr=0.0001384 inv=0.4137500 sub=0.0000000
17-03-25 18:56:05 [1] Step: 27000 Acc: 0.65844 0.84297 Cost: 1.22254 0.89333 0.25131 0.07790 Time: 0.00070
17-03-25 18:56:05 [1] Train Extra: lr=0.0001380 inv=0.4045313 sub=0.0000000
17-03-25 18:56:57 [1] Step: 27000 Eval acc: 0.65382 0.85047 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 18:56:57 [1] Eval Extra: inv=0.3990724
17-03-25 18:58:16 [1] Step: 27100 Acc: 0.63906 0.84179 Cost: 0.94727 0.67046 0.19886 0.07795 Time: 0.00071
17-03-25 18:58:16 [1] Train Extra: lr=0.0001376 inv=0.4209375 sub=0.0000000
17-03-25 18:59:37 [1] Step: 27200 Acc: 0.63875 0.84145 Cost: 0.93727 0.59629 0.26298 0.07800 Time: 0.00071
17-03-25 18:59:37 [1] Train Extra: lr=0.0001372 inv=0.4175000 sub=0.0000000
17-03-25 19:00:53 [1] Step: 27300 Acc: 0.64938 0.83383 Cost: 1.01862 0.76930 0.17133 0.07799 Time: 0.00066
17-03-25 19:00:53 [1] Train Extra: lr=0.0001368 inv=0.4514063 sub=0.0000000
17-03-25 19:02:05 [1] Step: 27400 Acc: 0.65125 0.83749 Cost: 1.22146 0.87874 0.26470 0.07802 Time: 0.00069
17-03-25 19:02:05 [1] Train Extra: lr=0.0001364 inv=0.4093750 sub=0.0000000
17-03-25 19:03:18 [1] Step: 27500 Acc: 0.65563 0.84285 Cost: 1.06259 0.74824 0.23625 0.07810 Time: 0.00069
17-03-25 19:03:18 [1] Train Extra: lr=0.0001360 inv=0.4004687 sub=0.0000000
17-03-25 19:04:10 [1] Step: 27500 Eval acc: 0.65216 0.85039 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 19:04:10 [1] Eval Extra: inv=0.3968087
17-03-25 19:05:36 [1] Step: 27600 Acc: 0.62031 0.84715 Cost: 1.25004 0.86174 0.31021 0.07810 Time: 0.00073
17-03-25 19:05:36 [1] Train Extra: lr=0.0001356 inv=0.4504687 sub=0.0000000
17-03-25 19:06:51 [1] Step: 27700 Acc: 0.65094 0.83442 Cost: 0.95251 0.71041 0.16402 0.07807 Time: 0.00067
17-03-25 19:06:51 [1] Train Extra: lr=0.0001352 inv=0.4631250 sub=0.0000000
17-03-25 19:08:13 [1] Step: 27800 Acc: 0.65281 0.84991 Cost: 1.15581 0.80206 0.27563 0.07813 Time: 0.00073
17-03-25 19:08:13 [1] Train Extra: lr=0.0001348 inv=0.4153125 sub=0.0000000
17-03-25 19:09:26 [1] Step: 27900 Acc: 0.65844 0.83806 Cost: 1.32426 1.00901 0.23710 0.07815 Time: 0.00068
17-03-25 19:09:26 [1] Train Extra: lr=0.0001344 inv=0.4257812 sub=0.0000000
17-03-25 19:10:36 [1] Step: 28000 Acc: 0.64844 0.83973 Cost: 0.97982 0.63082 0.27075 0.07825 Time: 0.00066
17-03-25 19:10:36 [1] Train Extra: lr=0.0001341 inv=0.3945312 sub=0.0000000
17-03-25 19:11:31 [1] Step: 28000 Eval acc: 0.65283 0.84833 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 19:11:31 [1] Eval Extra: inv=0.4230897
17-03-25 19:12:46 [1] Step: 28100 Acc: 0.66063 0.84082 Cost: 1.13271 0.96509 0.08932 0.07830 Time: 0.00069
17-03-25 19:12:46 [1] Train Extra: lr=0.0001337 inv=0.4356250 sub=0.0000000
17-03-25 19:13:59 [1] Step: 28200 Acc: 0.65687 0.83765 Cost: 1.02366 0.69296 0.25237 0.07833 Time: 0.00068
17-03-25 19:13:59 [1] Train Extra: lr=0.0001333 inv=0.4262500 sub=0.0000000
17-03-25 19:15:19 [1] Step: 28300 Acc: 0.64094 0.84553 Cost: 1.25352 0.89484 0.28018 0.07850 Time: 0.00071
17-03-25 19:15:19 [1] Train Extra: lr=0.0001329 inv=0.4212500 sub=0.0000000
17-03-25 19:16:41 [1] Step: 28400 Acc: 0.66375 0.84311 Cost: 1.40529 1.07115 0.25561 0.07853 Time: 0.00071
17-03-25 19:16:41 [1] Train Extra: lr=0.0001325 inv=0.4250000 sub=0.0000000
17-03-25 19:17:54 [1] Step: 28500 Acc: 0.64750 0.84113 Cost: 1.24279 0.84096 0.32328 0.07855 Time: 0.00069
17-03-25 19:17:54 [1] Train Extra: lr=0.0001321 inv=0.4378125 sub=0.0000000
17-03-25 19:18:46 [1] Step: 28500 Eval acc: 0.65857 0.85059 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 19:18:46 [1] Eval Extra: inv=0.4651612
17-03-25 19:18:46 [1] Checkpointing with new best dev accuracy of 0.658569
17-03-25 19:20:02 [1] Step: 28600 Acc: 0.64344 0.84788 Cost: 0.74023 0.57317 0.08841 0.07865 Time: 0.00070
17-03-25 19:20:02 [1] Train Extra: lr=0.0001318 inv=0.4156250 sub=0.0000000
17-03-25 19:21:24 [1] Step: 28700 Acc: 0.64156 0.84365 Cost: 1.34116 1.00985 0.25272 0.07858 Time: 0.00072
17-03-25 19:21:24 [1] Train Extra: lr=0.0001314 inv=0.4351563 sub=0.0000000
17-03-25 19:22:39 [1] Step: 28800 Acc: 0.64250 0.84261 Cost: 0.97138 0.69812 0.19457 0.07869 Time: 0.00070
17-03-25 19:22:39 [1] Train Extra: lr=0.0001310 inv=0.4420312 sub=0.0000000
17-03-25 19:23:46 [1] Step: 28900 Acc: 0.64156 0.84342 Cost: 1.10987 0.75916 0.27202 0.07868 Time: 0.00067
17-03-25 19:23:46 [1] Train Extra: lr=0.0001306 inv=0.4293750 sub=0.0000000
17-03-25 19:25:14 [1] Step: 29000 Acc: 0.63625 0.84395 Cost: 0.95795 0.75957 0.11963 0.07875 Time: 0.00072
17-03-25 19:25:14 [1] Train Extra: lr=0.0001303 inv=0.4618750 sub=0.0000000
17-03-25 19:26:06 [1] Step: 29000 Eval acc: 0.65813 0.84950 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 19:26:06 [1] Eval Extra: inv=0.3981890
17-03-25 19:27:32 [1] Step: 29100 Acc: 0.65156 0.84172 Cost: 1.13652 0.88011 0.17757 0.07884 Time: 0.00073
17-03-25 19:27:32 [1] Train Extra: lr=0.0001299 inv=0.4542188 sub=0.0000000
17-03-25 19:28:40 [1] Step: 29200 Acc: 0.64750 0.84583 Cost: 1.00737 0.74517 0.18335 0.07885 Time: 0.00068
17-03-25 19:28:40 [1] Train Extra: lr=0.0001295 inv=0.3890625 sub=0.0000000
17-03-25 19:29:55 [1] Step: 29300 Acc: 0.63406 0.84700 Cost: 1.13091 0.85606 0.19609 0.07876 Time: 0.00068
17-03-25 19:29:55 [1] Train Extra: lr=0.0001291 inv=0.4329688 sub=0.0000000
17-03-25 19:31:16 [1] Step: 29400 Acc: 0.65344 0.85300 Cost: 1.28863 0.89417 0.31563 0.07882 Time: 0.00074
17-03-25 19:31:16 [1] Train Extra: lr=0.0001288 inv=0.3954687 sub=0.0000000
17-03-25 19:32:36 [1] Step: 29500 Acc: 0.63750 0.84017 Cost: 1.14379 0.79368 0.27125 0.07886 Time: 0.00070
17-03-25 19:32:36 [1] Train Extra: lr=0.0001284 inv=0.4368750 sub=0.0000000
17-03-25 19:33:29 [1] Step: 29500 Eval acc: 0.65835 0.85266 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 19:33:29 [1] Eval Extra: inv=0.3875331
17-03-25 19:34:52 [1] Step: 29600 Acc: 0.64375 0.85108 Cost: 0.80176 0.56650 0.15630 0.07895 Time: 0.00076
17-03-25 19:34:52 [1] Train Extra: lr=0.0001280 inv=0.4212500 sub=0.0000000
17-03-25 19:35:55 [1] Step: 29700 Acc: 0.65063 0.84486 Cost: 1.11456 0.81374 0.22187 0.07895 Time: 0.00066
17-03-25 19:35:55 [1] Train Extra: lr=0.0001277 inv=0.3748437 sub=0.0000000
17-03-25 19:37:06 [1] Step: 29800 Acc: 0.64719 0.84260 Cost: 1.07834 0.83530 0.16407 0.07897 Time: 0.00068
17-03-25 19:37:06 [1] Train Extra: lr=0.0001273 inv=0.4212500 sub=0.0000000
17-03-25 19:38:12 [1] Step: 29900 Acc: 0.64000 0.84708 Cost: 1.38343 1.12360 0.18087 0.07896 Time: 0.00068
17-03-25 19:38:12 [1] Train Extra: lr=0.0001269 inv=0.3975000 sub=0.0000000
17-03-25 19:39:38 [1] Step: 30000 Acc: 0.62906 0.85481 Cost: 1.07975 0.81149 0.18926 0.07900 Time: 0.00075
17-03-25 19:39:38 [1] Train Extra: lr=0.0001266 inv=0.4303125 sub=0.0000000
17-03-25 19:40:31 [1] Step: 30000 Eval acc: 0.66133 0.84797 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 19:40:31 [1] Eval Extra: inv=0.4238074
17-03-25 19:40:31 [1] Checkpointing.
17-03-25 19:41:52 [1] Step: 30100 Acc: 0.63219 0.82817 Cost: 1.39055 0.98242 0.32922 0.07891 Time: 0.00067
17-03-25 19:41:52 [1] Train Extra: lr=0.0001262 inv=0.4820313 sub=0.0000000
17-03-25 19:43:03 [1] Step: 30200 Acc: 0.64062 0.85120 Cost: 1.11861 0.78726 0.25247 0.07888 Time: 0.00072
17-03-25 19:43:03 [1] Train Extra: lr=0.0001258 inv=0.3787500 sub=0.0000000
17-03-25 19:44:18 [1] Step: 30300 Acc: 0.64687 0.84555 Cost: 0.84067 0.66713 0.09463 0.07891 Time: 0.00069
17-03-25 19:44:18 [1] Train Extra: lr=0.0001255 inv=0.4187500 sub=0.0000000
17-03-25 19:45:33 [1] Step: 30400 Acc: 0.64844 0.84395 Cost: 0.92934 0.62503 0.22532 0.07900 Time: 0.00069
17-03-25 19:45:33 [1] Train Extra: lr=0.0001251 inv=0.4134375 sub=0.0000000
17-03-25 19:46:53 [1] Step: 30500 Acc: 0.64750 0.83164 Cost: 0.86308 0.67688 0.10707 0.07913 Time: 0.00069
17-03-25 19:46:53 [1] Train Extra: lr=0.0001248 inv=0.4431250 sub=0.0000000
17-03-25 19:47:47 [1] Step: 30500 Eval acc: 0.66100 0.85050 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 19:47:47 [1] Eval Extra: inv=0.4095627
17-03-25 19:49:18 [1] Step: 30600 Acc: 0.64938 0.84372 Cost: 1.11013 0.72053 0.31050 0.07910 Time: 0.00077
17-03-25 19:49:18 [1] Train Extra: lr=0.0001244 inv=0.4343750 sub=0.0000000
17-03-25 19:50:47 [1] Step: 30700 Acc: 0.63313 0.84508 Cost: 1.13003 0.69601 0.35495 0.07908 Time: 0.00076
17-03-25 19:50:47 [1] Train Extra: lr=0.0001240 inv=0.4459375 sub=0.0000000
17-03-25 19:52:07 [1] Step: 30800 Acc: 0.64031 0.84155 Cost: 1.08020 0.75964 0.24146 0.07909 Time: 0.00070
17-03-25 19:52:07 [1] Train Extra: lr=0.0001237 inv=0.4648438 sub=0.0000000
17-03-25 19:53:28 [1] Step: 30900 Acc: 0.64031 0.84079 Cost: 1.16345 0.78807 0.29627 0.07912 Time: 0.00072
17-03-25 19:53:28 [1] Train Extra: lr=0.0001233 inv=0.4428125 sub=0.0000000
17-03-25 19:54:47 [1] Step: 31000 Acc: 0.63875 0.84482 Cost: 1.07083 0.72106 0.27059 0.07918 Time: 0.00072
17-03-25 19:54:47 [1] Train Extra: lr=0.0001230 inv=0.4282813 sub=0.0000000
17-03-25 19:55:42 [1] Step: 31000 Eval acc: 0.65448 0.84837 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 19:55:42 [1] Eval Extra: inv=0.3905698
17-03-25 19:57:08 [1] Step: 31100 Acc: 0.64438 0.84575 Cost: 0.98917 0.71730 0.19271 0.07916 Time: 0.00076
17-03-25 19:57:08 [1] Train Extra: lr=0.0001226 inv=0.4420312 sub=0.0000000
17-03-25 19:58:25 [1] Step: 31200 Acc: 0.63625 0.84675 Cost: 0.85762 0.61783 0.16057 0.07922 Time: 0.00071
17-03-25 19:58:25 [1] Train Extra: lr=0.0001223 inv=0.4271875 sub=0.0000000
17-03-25 19:59:34 [1] Step: 31300 Acc: 0.63562 0.84814 Cost: 1.51658 1.14309 0.29431 0.07918 Time: 0.00068
17-03-25 19:59:34 [1] Train Extra: lr=0.0001219 inv=0.3943750 sub=0.0000000
17-03-25 20:00:55 [1] Step: 31400 Acc: 0.65812 0.84515 Cost: 1.05522 0.71703 0.25901 0.07918 Time: 0.00071
17-03-25 20:00:55 [1] Train Extra: lr=0.0001216 inv=0.4504687 sub=0.0000000
17-03-25 20:02:20 [1] Step: 31500 Acc: 0.65781 0.85092 Cost: 1.04849 0.76077 0.20849 0.07922 Time: 0.00075
17-03-25 20:02:20 [1] Train Extra: lr=0.0001212 inv=0.4237500 sub=0.0000000
17-03-25 20:03:09 [1] Step: 31500 Eval acc: 0.66023 0.85353 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00014
17-03-25 20:03:09 [1] Eval Extra: inv=0.4239731
17-03-25 20:04:21 [1] Step: 31600 Acc: 0.65250 0.85160 Cost: 1.09289 0.82024 0.19338 0.07927 Time: 0.00071
17-03-25 20:04:21 [1] Train Extra: lr=0.0001209 inv=0.4046875 sub=0.0000000
17-03-25 20:05:38 [1] Step: 31700 Acc: 0.65000 0.84753 Cost: 1.29562 0.98675 0.22960 0.07928 Time: 0.00070
17-03-25 20:05:38 [1] Train Extra: lr=0.0001205 inv=0.4059375 sub=0.0000000
17-03-25 20:06:56 [1] Step: 31800 Acc: 0.62219 0.83977 Cost: 0.80003 0.61546 0.10528 0.07930 Time: 0.00071
17-03-25 20:06:56 [1] Train Extra: lr=0.0001202 inv=0.4296875 sub=0.0000000
17-03-25 20:08:06 [1] Step: 31900 Acc: 0.63875 0.84515 Cost: 1.17530 0.87715 0.21882 0.07933 Time: 0.00066
17-03-25 20:08:06 [1] Train Extra: lr=0.0001198 inv=0.4192187 sub=0.0000000
17-03-25 20:09:24 [1] Step: 32000 Acc: 0.64094 0.84743 Cost: 0.99858 0.77734 0.14184 0.07940 Time: 0.00072
17-03-25 20:09:24 [1] Train Extra: lr=0.0001195 inv=0.3975000 sub=0.0000000
17-03-25 20:10:18 [1] Step: 32000 Eval acc: 0.66221 0.84614 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 20:10:18 [1] Eval Extra: inv=0.4313715
17-03-25 20:10:18 [1] Checkpointing with new best dev accuracy of 0.662213
17-03-25 20:11:40 [1] Step: 32100 Acc: 0.64094 0.84617 Cost: 0.97457 0.73736 0.15776 0.07945 Time: 0.00071
17-03-25 20:11:40 [1] Train Extra: lr=0.0001191 inv=0.4496875 sub=0.0000000
17-03-25 20:13:07 [1] Step: 32200 Acc: 0.64781 0.85682 Cost: 1.02516 0.73718 0.20848 0.07950 Time: 0.00076
17-03-25 20:13:07 [1] Train Extra: lr=0.0001188 inv=0.4335938 sub=0.0000000
17-03-25 20:14:29 [1] Step: 32300 Acc: 0.63750 0.84334 Cost: 0.89002 0.72701 0.08349 0.07952 Time: 0.00073
17-03-25 20:14:29 [1] Train Extra: lr=0.0001185 inv=0.4254688 sub=0.0000000
17-03-25 20:15:52 [1] Step: 32400 Acc: 0.64750 0.85202 Cost: 0.80493 0.63755 0.08784 0.07954 Time: 0.00076
17-03-25 20:15:52 [1] Train Extra: lr=0.0001181 inv=0.3985938 sub=0.0000000
17-03-25 20:17:29 [1] Step: 32500 Acc: 0.63375 0.84875 Cost: 1.27863 0.83679 0.36235 0.07949 Time: 0.00078
17-03-25 20:17:29 [1] Train Extra: lr=0.0001178 inv=0.4337500 sub=0.0000000
17-03-25 20:18:25 [1] Step: 32500 Eval acc: 0.66299 0.85126 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 20:18:25 [1] Eval Extra: inv=0.4780808
17-03-25 20:19:35 [1] Step: 32600 Acc: 0.64438 0.84526 Cost: 1.12083 0.81122 0.23013 0.07948 Time: 0.00070
17-03-25 20:19:35 [1] Train Extra: lr=0.0001174 inv=0.4121875 sub=0.0000000
17-03-25 20:20:52 [1] Step: 32700 Acc: 0.63531 0.84560 Cost: 1.37892 1.06018 0.23931 0.07943 Time: 0.00073
17-03-25 20:20:52 [1] Train Extra: lr=0.0001171 inv=0.3964063 sub=0.0000000
17-03-25 20:22:16 [1] Step: 32800 Acc: 0.64625 0.83112 Cost: 1.01878 0.78035 0.15904 0.07940 Time: 0.00071
17-03-25 20:22:16 [1] Train Extra: lr=0.0001168 inv=0.4537500 sub=0.0000000
17-03-25 20:23:39 [1] Step: 32900 Acc: 0.64719 0.84522 Cost: 1.27821 0.92517 0.27366 0.07938 Time: 0.00074
17-03-25 20:23:39 [1] Train Extra: lr=0.0001164 inv=0.4525000 sub=0.0000000
17-03-25 20:24:56 [1] Step: 33000 Acc: 0.64031 0.84239 Cost: 0.84971 0.59953 0.17074 0.07944 Time: 0.00070
17-03-25 20:24:56 [1] Train Extra: lr=0.0001161 inv=0.4381250 sub=0.0000000
17-03-25 20:25:52 [1] Step: 33000 Eval acc: 0.65647 0.84582 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 20:25:52 [1] Eval Extra: inv=0.4402606
17-03-25 20:27:16 [1] Step: 33100 Acc: 0.63875 0.84577 Cost: 1.12883 0.80795 0.24147 0.07941 Time: 0.00074
17-03-25 20:27:16 [1] Train Extra: lr=0.0001158 inv=0.4267187 sub=0.0000000
17-03-25 20:28:35 [1] Step: 33200 Acc: 0.64250 0.84544 Cost: 1.24004 0.93633 0.22427 0.07944 Time: 0.00073
17-03-25 20:28:35 [1] Train Extra: lr=0.0001154 inv=0.4206250 sub=0.0000000
17-03-25 20:30:01 [1] Step: 33300 Acc: 0.64687 0.83503 Cost: 1.02061 0.71144 0.22973 0.07944 Time: 0.00072
17-03-25 20:30:01 [1] Train Extra: lr=0.0001151 inv=0.4557813 sub=0.0000000
17-03-25 20:31:19 [1] Step: 33400 Acc: 0.66156 0.85500 Cost: 0.91576 0.67144 0.16483 0.07948 Time: 0.00076
17-03-25 20:31:19 [1] Train Extra: lr=0.0001148 inv=0.4018750 sub=0.0000000
17-03-25 20:32:37 [1] Step: 33500 Acc: 0.64219 0.84288 Cost: 0.99300 0.70732 0.20615 0.07953 Time: 0.00070
17-03-25 20:32:37 [1] Train Extra: lr=0.0001144 inv=0.4539063 sub=0.0000000
17-03-25 20:33:34 [1] Step: 33500 Eval acc: 0.66199 0.84824 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 20:33:34 [1] Eval Extra: inv=0.4223719
17-03-25 20:34:53 [1] Step: 33600 Acc: 0.63719 0.84111 Cost: 0.86923 0.67340 0.11619 0.07965 Time: 0.00074
17-03-25 20:34:53 [1] Train Extra: lr=0.0001141 inv=0.4151563 sub=0.0000000
17-03-25 20:36:10 [1] Step: 33700 Acc: 0.66375 0.84604 Cost: 1.07236 0.75208 0.24052 0.07976 Time: 0.00073
17-03-25 20:36:10 [1] Train Extra: lr=0.0001138 inv=0.4167188 sub=0.0000000
17-03-25 20:37:30 [1] Step: 33800 Acc: 0.66312 0.84463 Cost: 1.00107 0.69419 0.22696 0.07992 Time: 0.00073
17-03-25 20:37:30 [1] Train Extra: lr=0.0001135 inv=0.4400000 sub=0.0000000
17-03-25 20:38:51 [1] Step: 33900 Acc: 0.67031 0.84463 Cost: 1.13431 0.83213 0.22224 0.07994 Time: 0.00071
17-03-25 20:38:51 [1] Train Extra: lr=0.0001131 inv=0.4450000 sub=0.0000000
17-03-25 20:40:12 [1] Step: 34000 Acc: 0.65969 0.85767 Cost: 0.94956 0.76972 0.09980 0.08004 Time: 0.00079
17-03-25 20:40:12 [1] Train Extra: lr=0.0001128 inv=0.3754688 sub=0.0000000
17-03-25 20:41:05 [1] Step: 34000 Eval acc: 0.66718 0.84868 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 20:41:05 [1] Eval Extra: inv=0.4184519
17-03-25 20:41:05 [1] Checkpointing with new best dev accuracy of 0.667182
17-03-25 20:42:28 [1] Step: 34100 Acc: 0.66219 0.84053 Cost: 0.99711 0.74793 0.16901 0.08017 Time: 0.00074
17-03-25 20:42:28 [1] Train Extra: lr=0.0001125 inv=0.4067188 sub=0.0000000
17-03-25 20:43:55 [1] Step: 34200 Acc: 0.66719 0.85366 Cost: 1.10482 0.76629 0.25837 0.08016 Time: 0.00078
17-03-25 20:43:55 [1] Train Extra: lr=0.0001122 inv=0.4231250 sub=0.0000000
17-03-25 20:45:12 [1] Step: 34300 Acc: 0.65750 0.84319 Cost: 1.15287 0.82401 0.24855 0.08031 Time: 0.00073
17-03-25 20:45:12 [1] Train Extra: lr=0.0001118 inv=0.4156250 sub=0.0000000
17-03-25 20:46:36 [1] Step: 34400 Acc: 0.66187 0.84766 Cost: 0.99645 0.71143 0.20459 0.08043 Time: 0.00075
17-03-25 20:46:36 [1] Train Extra: lr=0.0001115 inv=0.4156250 sub=0.0000000
17-03-25 20:47:59 [1] Step: 34500 Acc: 0.65344 0.85099 Cost: 1.35161 1.01604 0.25512 0.08045 Time: 0.00076
17-03-25 20:47:59 [1] Train Extra: lr=0.0001112 inv=0.4006250 sub=0.0000000
17-03-25 20:48:51 [1] Step: 34500 Eval acc: 0.66034 0.85125 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00015
17-03-25 20:48:51 [1] Eval Extra: inv=0.4196113
17-03-25 20:50:13 [1] Step: 34600 Acc: 0.65781 0.84777 Cost: 1.07596 0.84887 0.14660 0.08049 Time: 0.00076
17-03-25 20:50:13 [1] Train Extra: lr=0.0001109 inv=0.4131250 sub=0.0000000
17-03-25 20:51:31 [1] Step: 34700 Acc: 0.65781 0.83963 Cost: 0.81659 0.64053 0.09545 0.08061 Time: 0.00071
17-03-25 20:51:31 [1] Train Extra: lr=0.0001106 inv=0.4320312 sub=0.0000000
17-03-25 20:52:44 [1] Step: 34800 Acc: 0.66844 0.84601 Cost: 1.15875 0.80707 0.27100 0.08068 Time: 0.00069
17-03-25 20:52:44 [1] Train Extra: lr=0.0001102 inv=0.4150000 sub=0.0000000
17-03-25 20:54:12 [1] Step: 34900 Acc: 0.66875 0.83943 Cost: 1.08158 0.81033 0.19049 0.08076 Time: 0.00074
17-03-25 20:54:12 [1] Train Extra: lr=0.0001099 inv=0.4542188 sub=0.0000000
17-03-25 20:55:31 [1] Step: 35000 Acc: 0.65812 0.84595 Cost: 1.22298 0.87723 0.26495 0.08079 Time: 0.00073
17-03-25 20:55:31 [1] Train Extra: lr=0.0001096 inv=0.4035937 sub=0.0000000
17-03-25 20:56:28 [1] Step: 35000 Eval acc: 0.66453 0.85044 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 20:56:28 [1] Eval Extra: inv=0.4473830
17-03-25 20:56:28 [1] Checkpointing.
17-03-25 20:57:54 [1] Step: 35100 Acc: 0.64844 0.84043 Cost: 0.89845 0.62774 0.18983 0.08087 Time: 0.00075
17-03-25 20:57:54 [1] Train Extra: lr=0.0001093 inv=0.4539063 sub=0.0000000
17-03-25 20:59:17 [1] Step: 35200 Acc: 0.65812 0.83587 Cost: 1.18891 0.88633 0.22159 0.08099 Time: 0.00071
17-03-25 20:59:17 [1] Train Extra: lr=0.0001090 inv=0.4546875 sub=0.0000000
17-03-25 21:00:41 [1] Step: 35300 Acc: 0.64594 0.84056 Cost: 1.11124 0.76778 0.26244 0.08103 Time: 0.00074
17-03-25 21:00:41 [1] Train Extra: lr=0.0001087 inv=0.4257812 sub=0.0000000
17-03-25 21:01:59 [1] Step: 35400 Acc: 0.67406 0.84513 Cost: 0.97400 0.65719 0.23566 0.08115 Time: 0.00072
17-03-25 21:01:59 [1] Train Extra: lr=0.0001084 inv=0.4245313 sub=0.0000000
17-03-25 21:03:18 [1] Step: 35500 Acc: 0.65438 0.84826 Cost: 0.96814 0.73506 0.15187 0.08122 Time: 0.00073
17-03-25 21:03:18 [1] Train Extra: lr=0.0001080 inv=0.4129687 sub=0.0000000
17-03-25 21:04:14 [1] Step: 35500 Eval acc: 0.66508 0.85055 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 21:04:14 [1] Eval Extra: inv=0.4180654
17-03-25 21:05:33 [1] Step: 35600 Acc: 0.66094 0.84607 Cost: 1.09889 0.79560 0.22206 0.08123 Time: 0.00072
17-03-25 21:05:33 [1] Train Extra: lr=0.0001077 inv=0.4104687 sub=0.0000000
17-03-25 21:07:03 [1] Step: 35700 Acc: 0.65375 0.84824 Cost: 0.97943 0.78266 0.11550 0.08126 Time: 0.00078
17-03-25 21:07:03 [1] Train Extra: lr=0.0001074 inv=0.4251563 sub=0.0000000
17-03-25 21:08:27 [1] Step: 35800 Acc: 0.65156 0.84055 Cost: 1.34013 0.94096 0.31786 0.08131 Time: 0.00074
17-03-25 21:08:27 [1] Train Extra: lr=0.0001071 inv=0.4268750 sub=0.0000000
17-03-25 21:09:52 [1] Step: 35900 Acc: 0.65969 0.84685 Cost: 0.80901 0.59239 0.13518 0.08145 Time: 0.00073
17-03-25 21:09:52 [1] Train Extra: lr=0.0001068 inv=0.4304688 sub=0.0000000
17-03-25 21:11:16 [1] Step: 36000 Acc: 0.65250 0.83484 Cost: 0.93476 0.68602 0.16720 0.08155 Time: 0.00071
17-03-25 21:11:16 [1] Train Extra: lr=0.0001065 inv=0.4604687 sub=0.0000000
17-03-25 21:12:13 [1] Step: 36000 Eval acc: 0.65802 0.84871 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 21:12:13 [1] Eval Extra: inv=0.4073542
17-03-25 21:13:29 [1] Step: 36100 Acc: 0.65938 0.84263 Cost: 1.23279 0.85096 0.30025 0.08158 Time: 0.00073
17-03-25 21:13:29 [1] Train Extra: lr=0.0001062 inv=0.3984375 sub=0.0000000
17-03-25 21:14:59 [1] Step: 36200 Acc: 0.66094 0.85684 Cost: 0.94793 0.64722 0.21903 0.08169 Time: 0.00080
17-03-25 21:14:59 [1] Train Extra: lr=0.0001059 inv=0.4157812 sub=0.0000000
17-03-25 21:16:20 [1] Step: 36300 Acc: 0.66938 0.84581 Cost: 1.10284 0.74258 0.27847 0.08179 Time: 0.00074
17-03-25 21:16:20 [1] Train Extra: lr=0.0001056 inv=0.4179688 sub=0.0000000
17-03-25 21:17:49 [1] Step: 36400 Acc: 0.65094 0.83856 Cost: 1.04696 0.73437 0.23074 0.08185 Time: 0.00075
17-03-25 21:17:49 [1] Train Extra: lr=0.0001053 inv=0.4717188 sub=0.0000000
17-03-25 21:19:21 [1] Step: 36500 Acc: 0.65094 0.84870 Cost: 1.01680 0.67426 0.26063 0.08191 Time: 0.00079
17-03-25 21:19:21 [1] Train Extra: lr=0.0001050 inv=0.4373437 sub=0.0000000
17-03-25 21:20:17 [1] Step: 36500 Eval acc: 0.66542 0.84752 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 21:20:17 [1] Eval Extra: inv=0.3668838
17-03-25 21:21:36 [1] Step: 36600 Acc: 0.66531 0.84388 Cost: 1.36098 1.07433 0.20468 0.08197 Time: 0.00071
17-03-25 21:21:36 [1] Train Extra: lr=0.0001047 inv=0.4459375 sub=0.0000000
17-03-25 21:22:59 [1] Step: 36700 Acc: 0.65469 0.85790 Cost: 1.09228 0.83007 0.18025 0.08196 Time: 0.00078
17-03-25 21:22:59 [1] Train Extra: lr=0.0001044 inv=0.3890625 sub=0.0000000
17-03-25 21:24:12 [1] Step: 36800 Acc: 0.65594 0.84634 Cost: 1.03987 0.70731 0.25052 0.08203 Time: 0.00071
17-03-25 21:24:12 [1] Train Extra: lr=0.0001041 inv=0.3992188 sub=0.0000000
17-03-25 21:25:30 [1] Step: 36900 Acc: 0.65875 0.84248 Cost: 1.24312 0.94315 0.21787 0.08210 Time: 0.00072
17-03-25 21:25:30 [1] Train Extra: lr=0.0001038 inv=0.4046875 sub=0.0000000
17-03-25 21:26:51 [1] Step: 37000 Acc: 0.66156 0.84229 Cost: 1.04480 0.72927 0.23331 0.08222 Time: 0.00075
17-03-25 21:26:51 [1] Train Extra: lr=0.0001035 inv=0.4032812 sub=0.0000000
17-03-25 21:27:48 [1] Step: 37000 Eval acc: 0.66166 0.85223 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 21:27:48 [1] Eval Extra: inv=0.4062500
17-03-25 21:29:07 [1] Step: 37100 Acc: 0.64500 0.84325 Cost: 0.82058 0.64597 0.09226 0.08234 Time: 0.00073
17-03-25 21:29:07 [1] Train Extra: lr=0.0001032 inv=0.4284375 sub=0.0000000
17-03-25 21:30:33 [1] Step: 37200 Acc: 0.67250 0.85392 Cost: 1.12313 0.76888 0.27187 0.08237 Time: 0.00076
17-03-25 21:30:33 [1] Train Extra: lr=0.0001029 inv=0.4131250 sub=0.0000000
17-03-25 21:31:46 [1] Step: 37300 Acc: 0.64719 0.84832 Cost: 1.00409 0.75617 0.16547 0.08245 Time: 0.00070
17-03-25 21:31:46 [1] Train Extra: lr=0.0001026 inv=0.3998437 sub=0.0000000
17-03-25 21:33:03 [1] Step: 37400 Acc: 0.65781 0.84022 Cost: 1.22312 0.99660 0.14405 0.08247 Time: 0.00070
17-03-25 21:33:03 [1] Train Extra: lr=0.0001023 inv=0.4373437 sub=0.0000000
17-03-25 21:34:18 [1] Step: 37500 Acc: 0.66063 0.84310 Cost: 1.19217 0.81688 0.29281 0.08248 Time: 0.00073
17-03-25 21:34:18 [1] Train Extra: lr=0.0001020 inv=0.4220313 sub=0.0000000
17-03-25 21:35:12 [1] Step: 37500 Eval acc: 0.66464 0.85175 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 21:35:12 [1] Eval Extra: inv=0.4370031
17-03-25 21:36:29 [1] Step: 37600 Acc: 0.64219 0.85176 Cost: 1.21445 0.88167 0.25022 0.08256 Time: 0.00074
17-03-25 21:36:29 [1] Train Extra: lr=0.0001017 inv=0.4031250 sub=0.0000000
17-03-25 21:37:59 [1] Step: 37700 Acc: 0.65531 0.83785 Cost: 1.05781 0.87607 0.09913 0.08261 Time: 0.00074
17-03-25 21:37:59 [1] Train Extra: lr=0.0001014 inv=0.4456250 sub=0.0000000
17-03-25 21:39:17 [1] Step: 37800 Acc: 0.65125 0.84254 Cost: 0.98009 0.72216 0.17536 0.08257 Time: 0.00072
17-03-25 21:39:17 [1] Train Extra: lr=0.0001011 inv=0.4185937 sub=0.0000000
17-03-25 21:40:41 [1] Step: 37900 Acc: 0.65875 0.84657 Cost: 1.31020 0.93953 0.28803 0.08265 Time: 0.00075
17-03-25 21:40:41 [1] Train Extra: lr=0.0001008 inv=0.4404688 sub=0.0000000
17-03-25 21:41:55 [1] Step: 38000 Acc: 0.65906 0.83958 Cost: 1.04253 0.83741 0.12240 0.08273 Time: 0.00068
17-03-25 21:41:55 [1] Train Extra: lr=0.0001005 inv=0.4406250 sub=0.0000000
17-03-25 21:42:51 [1] Step: 38000 Eval acc: 0.67083 0.84607 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 21:42:51 [1] Eval Extra: inv=0.4295495
17-03-25 21:42:51 [1] Checkpointing with new best dev accuracy of 0.670826
17-03-25 21:44:13 [1] Step: 38100 Acc: 0.65844 0.83812 Cost: 0.91760 0.69010 0.14479 0.08270 Time: 0.00072
17-03-25 21:44:13 [1] Train Extra: lr=0.0001003 inv=0.4410938 sub=0.0000000
17-03-25 21:45:35 [1] Step: 38200 Acc: 0.65812 0.83526 Cost: 0.98698 0.69724 0.20698 0.08276 Time: 0.00073
17-03-25 21:45:35 [1] Train Extra: lr=0.0001000 inv=0.4389062 sub=0.0000000
17-03-25 21:47:01 [1] Step: 38300 Acc: 0.64844 0.84643 Cost: 1.02342 0.78532 0.15527 0.08283 Time: 0.00074
17-03-25 21:47:01 [1] Train Extra: lr=0.0000997 inv=0.4403125 sub=0.0000000
17-03-25 21:48:18 [1] Step: 38400 Acc: 0.64375 0.84022 Cost: 1.00401 0.77082 0.15026 0.08293 Time: 0.00071
17-03-25 21:48:18 [1] Train Extra: lr=0.0000994 inv=0.4250000 sub=0.0000000
17-03-25 21:49:37 [1] Step: 38500 Acc: 0.66094 0.84027 Cost: 1.16837 0.89382 0.19158 0.08297 Time: 0.00072
17-03-25 21:49:37 [1] Train Extra: lr=0.0000991 inv=0.4009375 sub=0.0000000
17-03-25 21:50:33 [1] Step: 38500 Eval acc: 0.66453 0.84959 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 21:50:33 [1] Eval Extra: inv=0.4054770
17-03-25 21:51:58 [1] Step: 38600 Acc: 0.65563 0.85069 Cost: 0.93811 0.75129 0.10382 0.08300 Time: 0.00075
17-03-25 21:51:58 [1] Train Extra: lr=0.0000988 inv=0.4165625 sub=0.0000000
17-03-25 21:53:18 [1] Step: 38700 Acc: 0.65719 0.84010 Cost: 1.04455 0.74046 0.22106 0.08303 Time: 0.00069
17-03-25 21:53:18 [1] Train Extra: lr=0.0000985 inv=0.4503125 sub=0.0000000
17-03-25 21:54:40 [1] Step: 38800 Acc: 0.64625 0.84623 Cost: 0.90192 0.60505 0.21380 0.08307 Time: 0.00074
17-03-25 21:54:40 [1] Train Extra: lr=0.0000983 inv=0.4314062 sub=0.0000000
17-03-25 21:55:57 [1] Step: 38900 Acc: 0.66750 0.85043 Cost: 1.05040 0.86892 0.09836 0.08312 Time: 0.00074
17-03-25 21:55:57 [1] Train Extra: lr=0.0000980 inv=0.4126563 sub=0.0000000
17-03-25 21:57:17 [1] Step: 39000 Acc: 0.65281 0.84158 Cost: 1.37510 0.99890 0.29304 0.08316 Time: 0.00072
17-03-25 21:57:17 [1] Train Extra: lr=0.0000977 inv=0.4407813 sub=0.0000000
17-03-25 21:58:14 [1] Step: 39000 Eval acc: 0.66696 0.85278 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 21:58:14 [1] Eval Extra: inv=0.4177341
17-03-25 21:59:39 [1] Step: 39100 Acc: 0.65687 0.84961 Cost: 1.31214 1.03542 0.19357 0.08316 Time: 0.00076
17-03-25 21:59:39 [1] Train Extra: lr=0.0000974 inv=0.4135937 sub=0.0000000
17-03-25 22:01:04 [1] Step: 39200 Acc: 0.66938 0.84691 Cost: 1.20991 0.91050 0.21623 0.08318 Time: 0.00077
17-03-25 22:01:04 [1] Train Extra: lr=0.0000971 inv=0.4004687 sub=0.0000000
17-03-25 22:02:16 [1] Step: 39300 Acc: 0.65969 0.84883 Cost: 0.90985 0.65364 0.17302 0.08319 Time: 0.00071
17-03-25 22:02:16 [1] Train Extra: lr=0.0000969 inv=0.3918750 sub=0.0000000
17-03-25 22:03:39 [1] Step: 39400 Acc: 0.66531 0.84749 Cost: 1.10703 0.79534 0.22851 0.08318 Time: 0.00074
17-03-25 22:03:39 [1] Train Extra: lr=0.0000966 inv=0.4178125 sub=0.0000000
17-03-25 22:04:48 [1] Step: 39500 Acc: 0.66687 0.84518 Cost: 1.02487 0.81716 0.12441 0.08330 Time: 0.00069
17-03-25 22:04:48 [1] Train Extra: lr=0.0000963 inv=0.4139062 sub=0.0000000
17-03-25 22:05:45 [1] Step: 39500 Eval acc: 0.67016 0.84524 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 22:05:45 [1] Eval Extra: inv=0.4541740
17-03-25 22:07:10 [1] Step: 39600 Acc: 0.66031 0.84793 Cost: 1.18693 0.79335 0.31022 0.08337 Time: 0.00078
17-03-25 22:07:10 [1] Train Extra: lr=0.0000960 inv=0.4084375 sub=0.0000000
17-03-25 22:08:24 [1] Step: 39700 Acc: 0.64969 0.85232 Cost: 0.77096 0.58550 0.10210 0.08336 Time: 0.00074
17-03-25 22:08:24 [1] Train Extra: lr=0.0000957 inv=0.3882813 sub=0.0000000
17-03-25 22:09:50 [1] Step: 39800 Acc: 0.64438 0.84393 Cost: 1.17367 0.76362 0.32670 0.08335 Time: 0.00075
17-03-25 22:09:50 [1] Train Extra: lr=0.0000955 inv=0.4156250 sub=0.0000000
17-03-25 22:11:14 [1] Step: 39900 Acc: 0.64906 0.85599 Cost: 1.04839 0.73348 0.23152 0.08339 Time: 0.00077
17-03-25 22:11:14 [1] Train Extra: lr=0.0000952 inv=0.4062500 sub=0.0000000
17-03-25 22:12:38 [1] Step: 40000 Acc: 0.65438 0.85274 Cost: 1.26070 0.81632 0.36098 0.08340 Time: 0.00077
17-03-25 22:12:38 [1] Train Extra: lr=0.0000949 inv=0.4054687 sub=0.0000000
17-03-25 22:13:33 [1] Step: 40000 Eval acc: 0.66376 0.85663 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 22:13:33 [1] Eval Extra: inv=0.3473940
17-03-25 22:13:33 [1] Checkpointing.
17-03-25 22:14:49 [1] Step: 40100 Acc: 0.65375 0.84625 Cost: 1.02092 0.77770 0.15979 0.08344 Time: 0.00074
17-03-25 22:14:49 [1] Train Extra: lr=0.0000946 inv=0.4051562 sub=0.0000000
17-03-25 22:16:08 [1] Step: 40200 Acc: 0.66625 0.84622 Cost: 0.96090 0.71929 0.15813 0.08348 Time: 0.00072
17-03-25 22:16:08 [1] Train Extra: lr=0.0000944 inv=0.4253125 sub=0.0000000
17-03-25 22:17:33 [1] Step: 40300 Acc: 0.67000 0.83460 Cost: 1.07892 0.72947 0.26597 0.08348 Time: 0.00072
17-03-25 22:17:33 [1] Train Extra: lr=0.0000941 inv=0.4570312 sub=0.0000000
17-03-25 22:18:57 [1] Step: 40400 Acc: 0.63938 0.85092 Cost: 1.07970 0.82059 0.17561 0.08351 Time: 0.00075
17-03-25 22:18:57 [1] Train Extra: lr=0.0000938 inv=0.4081250 sub=0.0000000
17-03-25 22:20:19 [1] Step: 40500 Acc: 0.65469 0.84216 Cost: 1.30212 0.91771 0.30086 0.08355 Time: 0.00073
17-03-25 22:20:19 [1] Train Extra: lr=0.0000936 inv=0.4381250 sub=0.0000000
17-03-25 22:21:15 [1] Step: 40500 Eval acc: 0.66928 0.85428 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 22:21:15 [1] Eval Extra: inv=0.4057531
17-03-25 22:22:30 [1] Step: 40600 Acc: 0.65312 0.84486 Cost: 0.89827 0.69070 0.12399 0.08358 Time: 0.00071
17-03-25 22:22:30 [1] Train Extra: lr=0.0000933 inv=0.4229688 sub=0.0000000
17-03-25 22:23:54 [1] Step: 40700 Acc: 0.65594 0.85523 Cost: 1.06494 0.82851 0.15282 0.08360 Time: 0.00078
17-03-25 22:23:54 [1] Train Extra: lr=0.0000930 inv=0.4192187 sub=0.0000000
17-03-25 22:25:05 [1] Step: 40800 Acc: 0.65281 0.84835 Cost: 1.17150 0.77777 0.31004 0.08368 Time: 0.00073
17-03-25 22:25:05 [1] Train Extra: lr=0.0000928 inv=0.4023438 sub=0.0000000
17-03-25 22:26:23 [1] Step: 40900 Acc: 0.66844 0.84058 Cost: 1.02832 0.64302 0.30159 0.08372 Time: 0.00073
17-03-25 22:26:23 [1] Train Extra: lr=0.0000925 inv=0.4290625 sub=0.0000000
17-03-25 22:27:45 [1] Step: 41000 Acc: 0.64594 0.84558 Cost: 1.03403 0.70981 0.24048 0.08374 Time: 0.00073
17-03-25 22:27:45 [1] Train Extra: lr=0.0000922 inv=0.4250000 sub=0.0000000
17-03-25 22:28:42 [1] Step: 41000 Eval acc: 0.66409 0.84968 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 22:28:42 [1] Eval Extra: inv=0.4292182
17-03-25 22:30:03 [1] Step: 41100 Acc: 0.67281 0.84811 Cost: 1.09074 0.80079 0.20616 0.08379 Time: 0.00072
17-03-25 22:30:03 [1] Train Extra: lr=0.0000920 inv=0.4310937 sub=0.0000000
17-03-25 22:31:25 [1] Step: 41200 Acc: 0.64969 0.84111 Cost: 1.27769 0.92960 0.26425 0.08384 Time: 0.00074
17-03-25 22:31:25 [1] Train Extra: lr=0.0000917 inv=0.4334375 sub=0.0000000
17-03-25 22:32:43 [1] Step: 41300 Acc: 0.64406 0.84752 Cost: 1.19400 0.85752 0.25265 0.08383 Time: 0.00074
17-03-25 22:32:43 [1] Train Extra: lr=0.0000914 inv=0.4118750 sub=0.0000000
17-03-25 22:33:56 [1] Step: 41400 Acc: 0.65844 0.84513 Cost: 0.97492 0.76102 0.13008 0.08381 Time: 0.00071
17-03-25 22:33:56 [1] Train Extra: lr=0.0000912 inv=0.4112500 sub=0.0000000
17-03-25 22:35:20 [1] Step: 41500 Acc: 0.66469 0.85163 Cost: 1.25591 0.94724 0.22485 0.08382 Time: 0.00075
17-03-25 22:35:20 [1] Train Extra: lr=0.0000909 inv=0.4259375 sub=0.0000000
17-03-25 22:36:17 [1] Step: 41500 Eval acc: 0.66829 0.85018 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 22:36:17 [1] Eval Extra: inv=0.3815150
17-03-25 22:37:36 [1] Step: 41600 Acc: 0.65625 0.85206 Cost: 1.05874 0.86165 0.11325 0.08383 Time: 0.00075
17-03-25 22:37:36 [1] Train Extra: lr=0.0000907 inv=0.4118750 sub=0.0000000
17-03-25 22:38:59 [1] Step: 41700 Acc: 0.65531 0.84591 Cost: 1.21401 0.83638 0.29385 0.08378 Time: 0.00073
17-03-25 22:38:59 [1] Train Extra: lr=0.0000904 inv=0.4476562 sub=0.0000000
17-03-25 22:40:32 [1] Step: 41800 Acc: 0.65594 0.85524 Cost: 1.11970 0.81990 0.21598 0.08383 Time: 0.00078
17-03-25 22:40:32 [1] Train Extra: lr=0.0000901 inv=0.4353125 sub=0.0000000
17-03-25 22:42:03 [1] Step: 41900 Acc: 0.64062 0.84734 Cost: 1.05028 0.71607 0.25037 0.08385 Time: 0.00075
17-03-25 22:42:03 [1] Train Extra: lr=0.0000899 inv=0.4606250 sub=0.0000000
17-03-25 22:43:34 [1] Step: 42000 Acc: 0.66781 0.84401 Cost: 1.19143 0.78885 0.31863 0.08395 Time: 0.00079
17-03-25 22:43:34 [1] Train Extra: lr=0.0000896 inv=0.4356250 sub=0.0000000
17-03-25 22:44:32 [1] Step: 42000 Eval acc: 0.66508 0.85462 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 22:44:32 [1] Eval Extra: inv=0.4085689
17-03-25 22:45:52 [1] Step: 42100 Acc: 0.69281 0.84710 Cost: 0.95708 0.59600 0.27695 0.08414 Time: 0.00074
17-03-25 22:45:52 [1] Train Extra: lr=0.0000894 inv=0.4264062 sub=0.0000000
17-03-25 22:47:12 [1] Step: 42200 Acc: 0.66531 0.84841 Cost: 1.04107 0.69371 0.26315 0.08421 Time: 0.00074
17-03-25 22:47:12 [1] Train Extra: lr=0.0000891 inv=0.4156250 sub=0.0000000
17-03-25 22:48:45 [1] Step: 42300 Acc: 0.66406 0.84575 Cost: 1.31147 0.94006 0.28709 0.08432 Time: 0.00078
17-03-25 22:48:45 [1] Train Extra: lr=0.0000888 inv=0.4290625 sub=0.0000000
17-03-25 22:50:10 [1] Step: 42400 Acc: 0.66312 0.83455 Cost: 1.07765 0.71266 0.28055 0.08444 Time: 0.00072
17-03-25 22:50:10 [1] Train Extra: lr=0.0000886 inv=0.4459375 sub=0.0000000
17-03-25 22:51:29 [1] Step: 42500 Acc: 0.67500 0.84935 Cost: 0.78860 0.55052 0.15355 0.08453 Time: 0.00074
17-03-25 22:51:29 [1] Train Extra: lr=0.0000883 inv=0.4109375 sub=0.0000000
17-03-25 22:52:26 [1] Step: 42500 Eval acc: 0.66630 0.85161 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 22:52:26 [1] Eval Extra: inv=0.4141453
17-03-25 22:53:33 [1] Step: 42600 Acc: 0.69344 0.85265 Cost: 1.00485 0.69428 0.22595 0.08462 Time: 0.00069
17-03-25 22:53:33 [1] Train Extra: lr=0.0000881 inv=0.3821875 sub=0.0000000
17-03-25 22:55:02 [1] Step: 42700 Acc: 0.67000 0.85119 Cost: 1.08987 0.75341 0.25173 0.08472 Time: 0.00078
17-03-25 22:55:02 [1] Train Extra: lr=0.0000878 inv=0.4226563 sub=0.0000000
17-03-25 22:56:28 [1] Step: 42800 Acc: 0.67719 0.84260 Cost: 1.28720 0.93869 0.26375 0.08475 Time: 0.00073
17-03-25 22:56:28 [1] Train Extra: lr=0.0000876 inv=0.4417187 sub=0.0000000
17-03-25 22:57:59 [1] Step: 42900 Acc: 0.67375 0.84270 Cost: 1.31367 0.96025 0.26855 0.08487 Time: 0.00075
17-03-25 22:57:59 [1] Train Extra: lr=0.0000873 inv=0.4634375 sub=0.0000000
17-03-25 22:59:17 [1] Step: 43000 Acc: 0.65094 0.85083 Cost: 0.92182 0.67136 0.16558 0.08488 Time: 0.00073
17-03-25 22:59:17 [1] Train Extra: lr=0.0000871 inv=0.4417187 sub=0.0000000
17-03-25 23:00:13 [1] Step: 43000 Eval acc: 0.66718 0.85437 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 23:00:13 [1] Eval Extra: inv=0.4044280
17-03-25 23:01:43 [1] Step: 43100 Acc: 0.67281 0.84142 Cost: 1.13831 0.79687 0.25647 0.08497 Time: 0.00075
17-03-25 23:01:43 [1] Train Extra: lr=0.0000868 inv=0.4540625 sub=0.0000000
17-03-25 23:03:01 [1] Step: 43200 Acc: 0.66719 0.84347 Cost: 1.10008 0.76126 0.25377 0.08505 Time: 0.00074
17-03-25 23:03:01 [1] Train Extra: lr=0.0000866 inv=0.4179688 sub=0.0000000
17-03-25 23:04:15 [1] Step: 43300 Acc: 0.67312 0.84579 Cost: 1.02092 0.77520 0.16051 0.08522 Time: 0.00070
17-03-25 23:04:15 [1] Train Extra: lr=0.0000863 inv=0.4132812 sub=0.0000000
17-03-25 23:05:37 [1] Step: 43400 Acc: 0.66156 0.84785 Cost: 1.25393 0.86444 0.30420 0.08529 Time: 0.00075
17-03-25 23:05:37 [1] Train Extra: lr=0.0000861 inv=0.4232812 sub=0.0000000
17-03-25 23:06:53 [1] Step: 43500 Acc: 0.67625 0.85213 Cost: 1.58925 1.30029 0.20358 0.08537 Time: 0.00072
17-03-25 23:06:53 [1] Train Extra: lr=0.0000858 inv=0.4064063 sub=0.0000000
17-03-25 23:07:49 [1] Step: 43500 Eval acc: 0.66884 0.84436 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 23:07:49 [1] Eval Extra: inv=0.4849823
17-03-25 23:09:11 [1] Step: 43600 Acc: 0.66531 0.83683 Cost: 1.10227 0.84237 0.17450 0.08539 Time: 0.00073
17-03-25 23:09:11 [1] Train Extra: lr=0.0000856 inv=0.4564063 sub=0.0000000
17-03-25 23:10:37 [1] Step: 43700 Acc: 0.66687 0.85461 Cost: 1.27054 0.88766 0.29737 0.08551 Time: 0.00075
17-03-25 23:10:37 [1] Train Extra: lr=0.0000853 inv=0.4264062 sub=0.0000000
17-03-25 23:11:55 [1] Step: 43800 Acc: 0.66406 0.84694 Cost: 1.07984 0.73833 0.25588 0.08563 Time: 0.00072
17-03-25 23:11:55 [1] Train Extra: lr=0.0000851 inv=0.4373437 sub=0.0000000
17-03-25 23:13:18 [1] Step: 43900 Acc: 0.66438 0.84408 Cost: 0.99206 0.71967 0.18672 0.08567 Time: 0.00074
17-03-25 23:13:18 [1] Train Extra: lr=0.0000848 inv=0.4450000 sub=0.0000000
17-03-25 23:14:42 [1] Step: 44000 Acc: 0.67063 0.84818 Cost: 0.93856 0.64411 0.20873 0.08572 Time: 0.00075
17-03-25 23:14:42 [1] Train Extra: lr=0.0000846 inv=0.4354688 sub=0.0000000
17-03-25 23:15:40 [1] Step: 44000 Eval acc: 0.66807 0.85167 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 23:15:40 [1] Eval Extra: inv=0.3971400
17-03-25 23:17:01 [1] Step: 44100 Acc: 0.68500 0.85184 Cost: 1.22284 0.87646 0.26057 0.08581 Time: 0.00074
17-03-25 23:17:01 [1] Train Extra: lr=0.0000844 inv=0.4106250 sub=0.0000000
17-03-25 23:18:28 [1] Step: 44200 Acc: 0.67750 0.84563 Cost: 1.06784 0.75838 0.22350 0.08596 Time: 0.00075
17-03-25 23:18:28 [1] Train Extra: lr=0.0000841 inv=0.4376563 sub=0.0000000
17-03-25 23:19:45 [1] Step: 44300 Acc: 0.67937 0.85072 Cost: 1.03006 0.66118 0.28291 0.08597 Time: 0.00074
17-03-25 23:19:45 [1] Train Extra: lr=0.0000839 inv=0.3915625 sub=0.0000000
17-03-25 23:20:58 [1] Step: 44400 Acc: 0.67375 0.84742 Cost: 1.19213 0.97862 0.12742 0.08609 Time: 0.00070
17-03-25 23:20:58 [1] Train Extra: lr=0.0000836 inv=0.4293750 sub=0.0000000
17-03-25 23:22:17 [1] Step: 44500 Acc: 0.65281 0.85344 Cost: 1.09477 0.83344 0.17522 0.08612 Time: 0.00075
17-03-25 23:22:17 [1] Train Extra: lr=0.0000834 inv=0.3942188 sub=0.0000000
17-03-25 23:23:13 [1] Step: 44500 Eval acc: 0.66718 0.84536 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 23:23:13 [1] Eval Extra: inv=0.4659342
17-03-25 23:24:31 [1] Step: 44600 Acc: 0.67875 0.85711 Cost: 1.02312 0.80828 0.12873 0.08611 Time: 0.00077
17-03-25 23:24:31 [1] Train Extra: lr=0.0000832 inv=0.3928125 sub=0.0000000
17-03-25 23:25:38 [1] Step: 44700 Acc: 0.67031 0.84839 Cost: 1.15369 0.89068 0.17689 0.08612 Time: 0.00070
17-03-25 23:25:38 [1] Train Extra: lr=0.0000829 inv=0.3989063 sub=0.0000000
17-03-25 23:27:01 [1] Step: 44800 Acc: 0.67937 0.85025 Cost: 1.26361 0.99270 0.18475 0.08617 Time: 0.00077
17-03-25 23:27:01 [1] Train Extra: lr=0.0000827 inv=0.3990625 sub=0.0000000
17-03-25 23:28:22 [1] Step: 44900 Acc: 0.66750 0.83795 Cost: 0.85198 0.60627 0.15945 0.08627 Time: 0.00070
17-03-25 23:28:22 [1] Train Extra: lr=0.0000824 inv=0.4373437 sub=0.0000000
17-03-25 23:29:39 [1] Step: 45000 Acc: 0.68219 0.85278 Cost: 0.91474 0.65700 0.17141 0.08633 Time: 0.00074
17-03-25 23:29:39 [1] Train Extra: lr=0.0000822 inv=0.3823437 sub=0.0000000
17-03-25 23:30:37 [1] Step: 45000 Eval acc: 0.66354 0.84657 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 23:30:37 [1] Eval Extra: inv=0.4165746
17-03-25 23:30:37 [1] Checkpointing.
17-03-25 23:31:55 [1] Step: 45100 Acc: 0.66844 0.84490 Cost: 1.01853 0.73188 0.20026 0.08640 Time: 0.00073
17-03-25 23:31:55 [1] Train Extra: lr=0.0000820 inv=0.4196875 sub=0.0000000
17-03-25 23:33:21 [1] Step: 45200 Acc: 0.65812 0.84894 Cost: 1.30858 0.96512 0.25702 0.08645 Time: 0.00076
17-03-25 23:33:21 [1] Train Extra: lr=0.0000817 inv=0.4351563 sub=0.0000000
17-03-25 23:34:37 [1] Step: 45300 Acc: 0.65656 0.84906 Cost: 1.19067 0.90138 0.20279 0.08649 Time: 0.00073
17-03-25 23:34:37 [1] Train Extra: lr=0.0000815 inv=0.4109375 sub=0.0000000
17-03-25 23:35:59 [1] Step: 45400 Acc: 0.66938 0.84265 Cost: 0.88759 0.58931 0.21166 0.08661 Time: 0.00074
17-03-25 23:35:59 [1] Train Extra: lr=0.0000813 inv=0.4409375 sub=0.0000000
17-03-25 23:37:18 [1] Step: 45500 Acc: 0.67281 0.84835 Cost: 1.32741 0.92532 0.31543 0.08665 Time: 0.00074
17-03-25 23:37:18 [1] Train Extra: lr=0.0000810 inv=0.4201563 sub=0.0000000
17-03-25 23:38:14 [1] Step: 45500 Eval acc: 0.67005 0.85334 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 23:38:14 [1] Eval Extra: inv=0.4097836
17-03-25 23:39:39 [1] Step: 45600 Acc: 0.66031 0.83707 Cost: 0.85860 0.61402 0.15780 0.08678 Time: 0.00073
17-03-25 23:39:39 [1] Train Extra: lr=0.0000808 inv=0.4548437 sub=0.0000000
17-03-25 23:41:00 [1] Step: 45700 Acc: 0.67531 0.84552 Cost: 0.94410 0.60809 0.24922 0.08679 Time: 0.00072
17-03-25 23:41:00 [1] Train Extra: lr=0.0000806 inv=0.4495312 sub=0.0000000
17-03-25 23:42:19 [1] Step: 45800 Acc: 0.68750 0.84989 Cost: 1.17974 0.94697 0.14597 0.08681 Time: 0.00074
17-03-25 23:42:19 [1] Train Extra: lr=0.0000803 inv=0.4045313 sub=0.0000000
17-03-25 23:43:44 [1] Step: 45900 Acc: 0.66500 0.84530 Cost: 0.90645 0.65953 0.16000 0.08692 Time: 0.00076
17-03-25 23:43:44 [1] Train Extra: lr=0.0000801 inv=0.4225000 sub=0.0000000
17-03-25 23:45:07 [1] Step: 46000 Acc: 0.66281 0.84661 Cost: 0.90243 0.62066 0.19482 0.08695 Time: 0.00074
17-03-25 23:45:07 [1] Train Extra: lr=0.0000799 inv=0.4407813 sub=0.0000000
17-03-25 23:46:03 [1] Step: 46000 Eval acc: 0.66928 0.85162 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-25 23:46:03 [1] Eval Extra: inv=0.4096731
17-03-25 23:47:19 [1] Step: 46100 Acc: 0.67281 0.85450 Cost: 1.09099 0.85763 0.14635 0.08701 Time: 0.00076
17-03-25 23:47:19 [1] Train Extra: lr=0.0000796 inv=0.3820312 sub=0.0000000
17-03-25 23:48:37 [1] Step: 46200 Acc: 0.67469 0.84571 Cost: 1.04569 0.79676 0.16178 0.08715 Time: 0.00074
17-03-25 23:48:37 [1] Train Extra: lr=0.0000794 inv=0.4115625 sub=0.0000000
17-03-25 23:49:58 [1] Step: 46300 Acc: 0.65594 0.85080 Cost: 0.76377 0.51065 0.16587 0.08725 Time: 0.00074
17-03-25 23:49:58 [1] Train Extra: lr=0.0000792 inv=0.4229688 sub=0.0000000
17-03-25 23:51:29 [1] Step: 46400 Acc: 0.66281 0.84990 Cost: 1.07324 0.73657 0.24941 0.08726 Time: 0.00079
17-03-25 23:51:29 [1] Train Extra: lr=0.0000790 inv=0.4295312 sub=0.0000000
17-03-25 23:52:55 [1] Step: 46500 Acc: 0.67875 0.85438 Cost: 1.09298 0.79033 0.21538 0.08727 Time: 0.00078
17-03-25 23:52:55 [1] Train Extra: lr=0.0000787 inv=0.4156250 sub=0.0000000
17-03-25 23:53:53 [1] Step: 46500 Eval acc: 0.67204 0.85053 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-25 23:53:53 [1] Eval Extra: inv=0.4194457
17-03-25 23:55:06 [1] Step: 46600 Acc: 0.66906 0.85130 Cost: 1.03536 0.81561 0.13240 0.08735 Time: 0.00071
17-03-25 23:55:06 [1] Train Extra: lr=0.0000785 inv=0.3981250 sub=0.0000000
17-03-25 23:56:43 [1] Step: 46700 Acc: 0.67281 0.85631 Cost: 0.76623 0.52755 0.15126 0.08742 Time: 0.00081
17-03-25 23:56:43 [1] Train Extra: lr=0.0000783 inv=0.4204688 sub=0.0000000
17-03-25 23:58:04 [1] Step: 46800 Acc: 0.67031 0.84636 Cost: 1.01066 0.72842 0.19476 0.08749 Time: 0.00073
17-03-25 23:58:04 [1] Train Extra: lr=0.0000781 inv=0.4278125 sub=0.0000000
17-03-25 23:59:22 [1] Step: 46900 Acc: 0.66844 0.84495 Cost: 0.99637 0.68590 0.22293 0.08753 Time: 0.00073
17-03-25 23:59:22 [1] Train Extra: lr=0.0000778 inv=0.4184375 sub=0.0000000
17-03-26 00:00:45 [1] Step: 47000 Acc: 0.65187 0.85023 Cost: 0.98275 0.65273 0.24253 0.08749 Time: 0.00076
17-03-26 00:00:45 [1] Train Extra: lr=0.0000776 inv=0.4215625 sub=0.0000000
17-03-26 00:01:41 [1] Step: 47000 Eval acc: 0.66906 0.85418 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 00:01:41 [1] Eval Extra: inv=0.3984099
17-03-26 00:03:00 [1] Step: 47100 Acc: 0.66875 0.84787 Cost: 1.11944 0.72646 0.30542 0.08755 Time: 0.00074
17-03-26 00:03:00 [1] Train Extra: lr=0.0000774 inv=0.4210937 sub=0.0000000
17-03-26 00:04:28 [1] Step: 47200 Acc: 0.65312 0.84043 Cost: 1.18674 0.87457 0.22461 0.08756 Time: 0.00075
17-03-26 00:04:28 [1] Train Extra: lr=0.0000772 inv=0.4448437 sub=0.0000000
17-03-26 00:05:53 [1] Step: 47300 Acc: 0.66094 0.84559 Cost: 0.96820 0.71162 0.16903 0.08755 Time: 0.00075
17-03-26 00:05:53 [1] Train Extra: lr=0.0000769 inv=0.4154687 sub=0.0000000
17-03-26 00:07:11 [1] Step: 47400 Acc: 0.66719 0.85135 Cost: 1.19342 0.83805 0.26779 0.08758 Time: 0.00072
17-03-26 00:07:11 [1] Train Extra: lr=0.0000767 inv=0.4162500 sub=0.0000000
17-03-26 00:08:23 [1] Step: 47500 Acc: 0.65938 0.85056 Cost: 1.15599 0.76922 0.29909 0.08769 Time: 0.00071
17-03-26 00:08:23 [1] Train Extra: lr=0.0000765 inv=0.3932813 sub=0.0000000
17-03-26 00:09:20 [1] Step: 47500 Eval acc: 0.67281 0.85322 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 00:09:20 [1] Eval Extra: inv=0.3978578
17-03-26 00:10:51 [1] Step: 47600 Acc: 0.66719 0.84539 Cost: 1.04288 0.72779 0.22738 0.08772 Time: 0.00078
17-03-26 00:10:51 [1] Train Extra: lr=0.0000763 inv=0.4357813 sub=0.0000000
17-03-26 00:12:16 [1] Step: 47700 Acc: 0.65563 0.84633 Cost: 1.08927 0.76703 0.23446 0.08778 Time: 0.00076
17-03-26 00:12:16 [1] Train Extra: lr=0.0000761 inv=0.4340625 sub=0.0000000
17-03-26 00:13:37 [1] Step: 47800 Acc: 0.65469 0.84060 Cost: 1.15238 0.73104 0.33361 0.08773 Time: 0.00072
17-03-26 00:13:37 [1] Train Extra: lr=0.0000758 inv=0.4262500 sub=0.0000000
17-03-26 00:15:00 [1] Step: 47900 Acc: 0.66469 0.84260 Cost: 1.21581 0.93840 0.18969 0.08771 Time: 0.00076
17-03-26 00:15:00 [1] Train Extra: lr=0.0000756 inv=0.4312500 sub=0.0000000
17-03-26 00:16:20 [1] Step: 48000 Acc: 0.66344 0.84905 Cost: 0.97196 0.70294 0.18129 0.08774 Time: 0.00075
17-03-26 00:16:20 [1] Train Extra: lr=0.0000754 inv=0.4023438 sub=0.0000000
17-03-26 00:17:17 [1] Step: 48000 Eval acc: 0.67281 0.85555 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 00:17:17 [1] Eval Extra: inv=0.4287213
17-03-26 00:18:43 [1] Step: 48100 Acc: 0.65063 0.83908 Cost: 1.00698 0.71265 0.20644 0.08789 Time: 0.00074
17-03-26 00:18:43 [1] Train Extra: lr=0.0000752 inv=0.4545312 sub=0.0000000
17-03-26 00:20:11 [1] Step: 48200 Acc: 0.66438 0.85035 Cost: 1.10366 0.80457 0.21113 0.08795 Time: 0.00077
17-03-26 00:20:11 [1] Train Extra: lr=0.0000750 inv=0.4231250 sub=0.0000000
17-03-26 00:21:44 [1] Step: 48300 Acc: 0.66344 0.83915 Cost: 1.00218 0.69773 0.21649 0.08796 Time: 0.00075
17-03-26 00:21:44 [1] Train Extra: lr=0.0000748 inv=0.4525000 sub=0.0000000
17-03-26 00:23:02 [1] Step: 48400 Acc: 0.66594 0.85933 Cost: 1.01787 0.83658 0.09332 0.08797 Time: 0.00077
17-03-26 00:23:02 [1] Train Extra: lr=0.0000745 inv=0.3882813 sub=0.0000000
17-03-26 00:24:37 [1] Step: 48500 Acc: 0.65594 0.84822 Cost: 1.10367 0.73796 0.27766 0.08804 Time: 0.00079
17-03-26 00:24:37 [1] Train Extra: lr=0.0000743 inv=0.4537500 sub=0.0000000
17-03-26 00:25:35 [1] Step: 48500 Eval acc: 0.67392 0.85137 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 00:25:35 [1] Eval Extra: inv=0.4437390
17-03-26 00:27:02 [1] Step: 48600 Acc: 0.65375 0.84632 Cost: 1.46308 1.08038 0.29464 0.08807 Time: 0.00074
17-03-26 00:27:02 [1] Train Extra: lr=0.0000741 inv=0.4406250 sub=0.0000000
17-03-26 00:28:17 [1] Step: 48700 Acc: 0.66344 0.84688 Cost: 1.16598 0.87355 0.20434 0.08809 Time: 0.00073
17-03-26 00:28:17 [1] Train Extra: lr=0.0000739 inv=0.3998437 sub=0.0000000
17-03-26 00:29:43 [1] Step: 48800 Acc: 0.66969 0.84740 Cost: 1.14229 0.91262 0.14165 0.08803 Time: 0.00074
17-03-26 00:29:43 [1] Train Extra: lr=0.0000737 inv=0.4289062 sub=0.0000000
17-03-26 00:31:04 [1] Step: 48900 Acc: 0.66750 0.85301 Cost: 0.76669 0.57845 0.10018 0.08806 Time: 0.00076
17-03-26 00:31:04 [1] Train Extra: lr=0.0000735 inv=0.4376563 sub=0.0000000
17-03-26 00:32:31 [1] Step: 49000 Acc: 0.65969 0.84740 Cost: 1.12588 0.82730 0.21036 0.08821 Time: 0.00074
17-03-26 00:32:31 [1] Train Extra: lr=0.0000733 inv=0.4556250 sub=0.0000000
17-03-26 00:33:29 [1] Step: 49000 Eval acc: 0.67259 0.85460 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 00:33:29 [1] Eval Extra: inv=0.3809077
17-03-26 00:34:51 [1] Step: 49100 Acc: 0.67125 0.84064 Cost: 1.04995 0.68599 0.27574 0.08822 Time: 0.00075
17-03-26 00:34:51 [1] Train Extra: lr=0.0000731 inv=0.4142188 sub=0.0000000
17-03-26 00:36:03 [1] Step: 49200 Acc: 0.66687 0.84412 Cost: 1.14852 0.80534 0.25490 0.08828 Time: 0.00070
17-03-26 00:36:03 [1] Train Extra: lr=0.0000728 inv=0.4271875 sub=0.0000000
17-03-26 00:37:22 [1] Step: 49300 Acc: 0.66281 0.84458 Cost: 0.93342 0.75151 0.09366 0.08826 Time: 0.00072
17-03-26 00:37:22 [1] Train Extra: lr=0.0000726 inv=0.4487500 sub=0.0000000
17-03-26 00:38:41 [1] Step: 49400 Acc: 0.65812 0.84729 Cost: 1.10004 0.78811 0.22363 0.08831 Time: 0.00074
17-03-26 00:38:41 [1] Train Extra: lr=0.0000724 inv=0.4085937 sub=0.0000000
17-03-26 00:40:03 [1] Step: 49500 Acc: 0.67188 0.85157 Cost: 1.07904 0.74903 0.24167 0.08834 Time: 0.00076
17-03-26 00:40:03 [1] Train Extra: lr=0.0000722 inv=0.4145313 sub=0.0000000
17-03-26 00:40:58 [1] Step: 49500 Eval acc: 0.67856 0.84797 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-26 00:40:58 [1] Eval Extra: inv=0.4293838
17-03-26 00:40:58 [1] Checkpointing with new best dev accuracy of 0.678556
17-03-26 00:42:19 [1] Step: 49600 Acc: 0.66875 0.85065 Cost: 0.91674 0.70783 0.12046 0.08844 Time: 0.00073
17-03-26 00:42:19 [1] Train Extra: lr=0.0000720 inv=0.4217187 sub=0.0000000
17-03-26 00:43:37 [1] Step: 49700 Acc: 0.67031 0.84497 Cost: 1.20016 0.96946 0.14228 0.08843 Time: 0.00073
17-03-26 00:43:37 [1] Train Extra: lr=0.0000718 inv=0.4306250 sub=0.0000000
17-03-26 00:44:55 [1] Step: 49800 Acc: 0.66969 0.84968 Cost: 0.97069 0.58524 0.29702 0.08842 Time: 0.00073
17-03-26 00:44:55 [1] Train Extra: lr=0.0000716 inv=0.4084375 sub=0.0000000
17-03-26 00:46:24 [1] Step: 49900 Acc: 0.65844 0.85012 Cost: 1.18561 0.82864 0.26852 0.08845 Time: 0.00078
17-03-26 00:46:24 [1] Train Extra: lr=0.0000714 inv=0.4337500 sub=0.0000000
17-03-26 00:47:44 [1] Step: 50000 Acc: 0.68875 0.84404 Cost: 0.91180 0.54227 0.28097 0.08857 Time: 0.00074
17-03-26 00:47:44 [1] Train Extra: lr=0.0000712 inv=0.4215625 sub=0.0000000
17-03-26 00:48:42 [1] Step: 50000 Eval acc: 0.67259 0.85289 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 00:48:42 [1] Eval Extra: inv=0.4052562
17-03-26 00:48:42 [1] Checkpointing.
17-03-26 00:50:05 [1] Step: 50100 Acc: 0.66219 0.84067 Cost: 1.21949 0.83008 0.30083 0.08858 Time: 0.00074
17-03-26 00:50:05 [1] Train Extra: lr=0.0000710 inv=0.4317187 sub=0.0000000
17-03-26 00:51:24 [1] Step: 50200 Acc: 0.64938 0.85076 Cost: 1.14547 0.90150 0.15536 0.08862 Time: 0.00073
17-03-26 00:51:24 [1] Train Extra: lr=0.0000708 inv=0.4046875 sub=0.0000000
17-03-26 00:52:50 [1] Step: 50300 Acc: 0.67406 0.84726 Cost: 1.25737 0.91610 0.25264 0.08862 Time: 0.00075
17-03-26 00:52:50 [1] Train Extra: lr=0.0000706 inv=0.4279688 sub=0.0000000
17-03-26 00:54:05 [1] Step: 50400 Acc: 0.68969 0.84838 Cost: 0.90090 0.57974 0.23250 0.08865 Time: 0.00072
17-03-26 00:54:05 [1] Train Extra: lr=0.0000704 inv=0.3998437 sub=0.0000000
17-03-26 00:55:36 [1] Step: 50500 Acc: 0.68375 0.84073 Cost: 1.05656 0.74468 0.22309 0.08879 Time: 0.00075
17-03-26 00:55:36 [1] Train Extra: lr=0.0000702 inv=0.4343750 sub=0.0000000
17-03-26 00:56:34 [1] Step: 50500 Eval acc: 0.67105 0.85573 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 00:56:34 [1] Eval Extra: inv=0.3761595
17-03-26 00:58:00 [1] Step: 50600 Acc: 0.67250 0.84265 Cost: 0.93287 0.76757 0.07648 0.08882 Time: 0.00074
17-03-26 00:58:00 [1] Train Extra: lr=0.0000700 inv=0.4351563 sub=0.0000000
17-03-26 00:59:12 [1] Step: 50700 Acc: 0.69281 0.84802 Cost: 0.79453 0.53867 0.16691 0.08895 Time: 0.00072
17-03-26 00:59:12 [1] Train Extra: lr=0.0000698 inv=0.4089063 sub=0.0000000
17-03-26 01:00:31 [1] Step: 50800 Acc: 0.67594 0.84181 Cost: 0.80563 0.64195 0.07460 0.08908 Time: 0.00073
17-03-26 01:00:31 [1] Train Extra: lr=0.0000696 inv=0.4267187 sub=0.0000000
17-03-26 01:01:47 [1] Step: 50900 Acc: 0.68812 0.84178 Cost: 0.91608 0.56079 0.26617 0.08912 Time: 0.00071
17-03-26 01:01:47 [1] Train Extra: lr=0.0000694 inv=0.4204688 sub=0.0000000
17-03-26 01:03:16 [1] Step: 51000 Acc: 0.68844 0.85199 Cost: 1.03532 0.79109 0.15499 0.08924 Time: 0.00076
17-03-26 01:03:16 [1] Train Extra: lr=0.0000692 inv=0.4439063 sub=0.0000000
17-03-26 01:04:13 [1] Step: 51000 Eval acc: 0.67303 0.85618 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 01:04:13 [1] Eval Extra: inv=0.4047593
17-03-26 01:05:38 [1] Step: 51100 Acc: 0.68219 0.84197 Cost: 1.07931 0.66050 0.32946 0.08935 Time: 0.00073
17-03-26 01:05:38 [1] Train Extra: lr=0.0000690 inv=0.4437500 sub=0.0000000
17-03-26 01:07:07 [1] Step: 51200 Acc: 0.69125 0.84816 Cost: 0.80127 0.55645 0.15536 0.08946 Time: 0.00077
17-03-26 01:07:07 [1] Train Extra: lr=0.0000688 inv=0.4375000 sub=0.0000000
17-03-26 01:08:22 [1] Step: 51300 Acc: 0.68063 0.85021 Cost: 0.95337 0.70481 0.15903 0.08953 Time: 0.00072
17-03-26 01:08:22 [1] Train Extra: lr=0.0000686 inv=0.4121875 sub=0.0000000
17-03-26 01:09:46 [1] Step: 51400 Acc: 0.68875 0.85267 Cost: 0.96700 0.61635 0.26099 0.08966 Time: 0.00077
17-03-26 01:09:46 [1] Train Extra: lr=0.0000684 inv=0.4054687 sub=0.0000000
17-03-26 01:11:01 [1] Step: 51500 Acc: 0.69312 0.84732 Cost: 0.83603 0.64774 0.09856 0.08974 Time: 0.00072
17-03-26 01:11:01 [1] Train Extra: lr=0.0000682 inv=0.4165625 sub=0.0000000
17-03-26 01:11:58 [1] Step: 51500 Eval acc: 0.67171 0.84732 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 01:11:58 [1] Eval Extra: inv=0.4133171
17-03-26 01:13:16 [1] Step: 51600 Acc: 0.66938 0.84349 Cost: 0.91852 0.57506 0.25360 0.08986 Time: 0.00072
17-03-26 01:13:16 [1] Train Extra: lr=0.0000680 inv=0.4218750 sub=0.0000000
17-03-26 01:14:30 [1] Step: 51700 Acc: 0.68781 0.85743 Cost: 1.32740 0.92858 0.30892 0.08990 Time: 0.00075
17-03-26 01:14:30 [1] Train Extra: lr=0.0000678 inv=0.3976562 sub=0.0000000
17-03-26 01:15:55 [1] Step: 51800 Acc: 0.68437 0.85532 Cost: 1.16278 0.79675 0.27605 0.08998 Time: 0.00077
17-03-26 01:15:55 [1] Train Extra: lr=0.0000676 inv=0.4176563 sub=0.0000000
17-03-26 01:17:23 [1] Step: 51900 Acc: 0.67219 0.85512 Cost: 0.89559 0.50533 0.30022 0.09005 Time: 0.00076
17-03-26 01:17:23 [1] Train Extra: lr=0.0000674 inv=0.4325000 sub=0.0000000
17-03-26 01:18:41 [1] Step: 52000 Acc: 0.68563 0.85063 Cost: 0.90800 0.62130 0.19653 0.09017 Time: 0.00075
17-03-26 01:18:41 [1] Train Extra: lr=0.0000672 inv=0.4054687 sub=0.0000000
17-03-26 01:19:39 [1] Step: 52000 Eval acc: 0.67535 0.85345 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 01:19:39 [1] Eval Extra: inv=0.4365614
17-03-26 01:21:04 [1] Step: 52100 Acc: 0.67750 0.84726 Cost: 1.16951 0.86646 0.21288 0.09016 Time: 0.00074
17-03-26 01:21:04 [1] Train Extra: lr=0.0000670 inv=0.4248438 sub=0.0000000
17-03-26 01:22:23 [1] Step: 52200 Acc: 0.68250 0.84406 Cost: 0.96839 0.68724 0.19095 0.09020 Time: 0.00073
17-03-26 01:22:23 [1] Train Extra: lr=0.0000668 inv=0.4075000 sub=0.0000000
17-03-26 01:23:52 [1] Step: 52300 Acc: 0.67469 0.83904 Cost: 1.02850 0.77153 0.16669 0.09029 Time: 0.00075
17-03-26 01:23:52 [1] Train Extra: lr=0.0000666 inv=0.4595313 sub=0.0000000
17-03-26 01:25:11 [1] Step: 52400 Acc: 0.68094 0.84696 Cost: 0.76880 0.55877 0.11965 0.09038 Time: 0.00072
17-03-26 01:25:11 [1] Train Extra: lr=0.0000664 inv=0.4079687 sub=0.0000000
17-03-26 01:26:30 [1] Step: 52500 Acc: 0.66969 0.84530 Cost: 1.16767 0.84132 0.23586 0.09048 Time: 0.00072
17-03-26 01:26:30 [1] Train Extra: lr=0.0000663 inv=0.4317187 sub=0.0000000
17-03-26 01:27:30 [1] Step: 52500 Eval acc: 0.67878 0.84957 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00018
17-03-26 01:27:30 [1] Eval Extra: inv=0.4770318
17-03-26 01:28:51 [1] Step: 52600 Acc: 0.68219 0.85397 Cost: 1.00886 0.75609 0.16220 0.09057 Time: 0.00075
17-03-26 01:28:51 [1] Train Extra: lr=0.0000661 inv=0.3896875 sub=0.0000000
17-03-26 01:30:16 [1] Step: 52700 Acc: 0.67344 0.84819 Cost: 0.97582 0.75798 0.12722 0.09062 Time: 0.00075
17-03-26 01:30:16 [1] Train Extra: lr=0.0000659 inv=0.4271875 sub=0.0000000
17-03-26 01:31:35 [1] Step: 52800 Acc: 0.67344 0.83955 Cost: 1.24829 0.93018 0.22736 0.09075 Time: 0.00072
17-03-26 01:31:35 [1] Train Extra: lr=0.0000657 inv=0.4315625 sub=0.0000000
17-03-26 01:32:54 [1] Step: 52900 Acc: 0.67594 0.84859 Cost: 1.03128 0.72868 0.21185 0.09075 Time: 0.00073
17-03-26 01:32:54 [1] Train Extra: lr=0.0000655 inv=0.4071875 sub=0.0000000
17-03-26 01:34:23 [1] Step: 53000 Acc: 0.69594 0.84660 Cost: 1.15265 0.89573 0.16607 0.09086 Time: 0.00078
17-03-26 01:34:23 [1] Train Extra: lr=0.0000653 inv=0.4251563 sub=0.0000000
17-03-26 01:35:19 [1] Step: 53000 Eval acc: 0.67458 0.84620 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 01:35:19 [1] Eval Extra: inv=0.4329174
17-03-26 01:36:50 [1] Step: 53100 Acc: 0.68406 0.85156 Cost: 1.27005 0.86911 0.30997 0.09097 Time: 0.00077
17-03-26 01:36:50 [1] Train Extra: lr=0.0000651 inv=0.4339062 sub=0.0000000
17-03-26 01:38:18 [1] Step: 53200 Acc: 0.67750 0.84941 Cost: 1.16967 0.83586 0.24280 0.09101 Time: 0.00075
17-03-26 01:38:18 [1] Train Extra: lr=0.0000649 inv=0.4431250 sub=0.0000000
17-03-26 01:39:47 [1] Step: 53300 Acc: 0.68250 0.85409 Cost: 1.14978 0.85634 0.20234 0.09110 Time: 0.00079
17-03-26 01:39:47 [1] Train Extra: lr=0.0000647 inv=0.4307813 sub=0.0000000
17-03-26 01:41:13 [1] Step: 53400 Acc: 0.67563 0.84408 Cost: 1.27894 0.88721 0.30057 0.09116 Time: 0.00074
17-03-26 01:41:13 [1] Train Extra: lr=0.0000646 inv=0.4453125 sub=0.0000000
17-03-26 01:42:27 [1] Step: 53500 Acc: 0.67281 0.84940 Cost: 1.07179 0.70986 0.27077 0.09116 Time: 0.00069
17-03-26 01:42:27 [1] Train Extra: lr=0.0000644 inv=0.4065625 sub=0.0000000
17-03-26 01:43:23 [1] Step: 53500 Eval acc: 0.67370 0.85229 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 01:43:23 [1] Eval Extra: inv=0.4274514
17-03-26 01:44:47 [1] Step: 53600 Acc: 0.68250 0.84887 Cost: 0.88548 0.67611 0.11813 0.09124 Time: 0.00075
17-03-26 01:44:47 [1] Train Extra: lr=0.0000642 inv=0.4196875 sub=0.0000000
17-03-26 01:46:09 [1] Step: 53700 Acc: 0.68188 0.84050 Cost: 0.98621 0.64091 0.25397 0.09132 Time: 0.00073
17-03-26 01:46:09 [1] Train Extra: lr=0.0000640 inv=0.4300000 sub=0.0000000
17-03-26 01:47:22 [1] Step: 53800 Acc: 0.67625 0.85176 Cost: 0.78343 0.57542 0.11665 0.09136 Time: 0.00071
17-03-26 01:47:22 [1] Train Extra: lr=0.0000638 inv=0.4046875 sub=0.0000000
17-03-26 01:48:47 [1] Step: 53900 Acc: 0.69031 0.85401 Cost: 1.15925 0.91364 0.15421 0.09140 Time: 0.00076
17-03-26 01:48:47 [1] Train Extra: lr=0.0000636 inv=0.4100000 sub=0.0000000
17-03-26 01:50:14 [1] Step: 54000 Acc: 0.68812 0.84179 Cost: 1.22386 0.93255 0.19985 0.09146 Time: 0.00075
17-03-26 01:50:14 [1] Train Extra: lr=0.0000635 inv=0.4287500 sub=0.0000000
17-03-26 01:51:11 [1] Step: 54000 Eval acc: 0.67027 0.84825 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 01:51:11 [1] Eval Extra: inv=0.4435181
17-03-26 01:52:28 [1] Step: 54100 Acc: 0.66656 0.84992 Cost: 0.88118 0.60264 0.18696 0.09158 Time: 0.00075
17-03-26 01:52:28 [1] Train Extra: lr=0.0000633 inv=0.3768750 sub=0.0000000
17-03-26 01:53:55 [1] Step: 54200 Acc: 0.67563 0.85080 Cost: 1.20010 0.85253 0.25587 0.09170 Time: 0.00075
17-03-26 01:53:55 [1] Train Extra: lr=0.0000631 inv=0.4192187 sub=0.0000000
17-03-26 01:55:12 [1] Step: 54300 Acc: 0.68250 0.85122 Cost: 1.14331 0.85276 0.19873 0.09182 Time: 0.00075
17-03-26 01:55:12 [1] Train Extra: lr=0.0000629 inv=0.3773437 sub=0.0000000
17-03-26 01:56:32 [1] Step: 54400 Acc: 0.68406 0.84643 Cost: 0.85351 0.62590 0.13577 0.09183 Time: 0.00073
17-03-26 01:56:32 [1] Train Extra: lr=0.0000627 inv=0.3995313 sub=0.0000000
17-03-26 01:58:06 [1] Step: 54500 Acc: 0.67500 0.85282 Cost: 0.95903 0.77831 0.08885 0.09187 Time: 0.00079
17-03-26 01:58:06 [1] Train Extra: lr=0.0000625 inv=0.4389062 sub=0.0000000
17-03-26 01:59:03 [1] Step: 54500 Eval acc: 0.67613 0.85289 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 01:59:03 [1] Eval Extra: inv=0.4260711
17-03-26 02:00:27 [1] Step: 54600 Acc: 0.66781 0.84741 Cost: 0.98803 0.71926 0.17686 0.09192 Time: 0.00074
17-03-26 02:00:27 [1] Train Extra: lr=0.0000624 inv=0.4287500 sub=0.0000000
17-03-26 02:01:41 [1] Step: 54700 Acc: 0.67563 0.85233 Cost: 1.00173 0.64187 0.26788 0.09197 Time: 0.00072
17-03-26 02:01:41 [1] Train Extra: lr=0.0000622 inv=0.3796875 sub=0.0000000
17-03-26 02:02:58 [1] Step: 54800 Acc: 0.67437 0.85188 Cost: 1.06130 0.78511 0.18426 0.09192 Time: 0.00074
17-03-26 02:02:58 [1] Train Extra: lr=0.0000620 inv=0.4057812 sub=0.0000000
17-03-26 02:04:19 [1] Step: 54900 Acc: 0.67188 0.85288 Cost: 0.99471 0.78768 0.11504 0.09198 Time: 0.00074
17-03-26 02:04:19 [1] Train Extra: lr=0.0000618 inv=0.4228125 sub=0.0000000
17-03-26 02:05:44 [1] Step: 55000 Acc: 0.66906 0.84658 Cost: 0.92844 0.76228 0.07419 0.09197 Time: 0.00073
17-03-26 02:05:44 [1] Train Extra: lr=0.0000617 inv=0.4715625 sub=0.0000000
17-03-26 02:06:44 [1] Step: 55000 Eval acc: 0.67458 0.85031 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00018
17-03-26 02:06:44 [1] Eval Extra: inv=0.4050905
17-03-26 02:06:44 [1] Checkpointing.
17-03-26 02:08:03 [1] Step: 55100 Acc: 0.67156 0.84948 Cost: 1.10114 0.80376 0.20538 0.09200 Time: 0.00072
17-03-26 02:08:03 [1] Train Extra: lr=0.0000615 inv=0.4192187 sub=0.0000000
17-03-26 02:09:17 [1] Step: 55200 Acc: 0.69063 0.84815 Cost: 1.39035 0.94793 0.35035 0.09207 Time: 0.00070
17-03-26 02:09:17 [1] Train Extra: lr=0.0000613 inv=0.4106250 sub=0.0000000
17-03-26 02:10:36 [1] Step: 55300 Acc: 0.65906 0.84947 Cost: 1.21908 0.96873 0.15823 0.09212 Time: 0.00074
17-03-26 02:10:36 [1] Train Extra: lr=0.0000611 inv=0.4042188 sub=0.0000000
17-03-26 02:11:48 [1] Step: 55400 Acc: 0.67281 0.85258 Cost: 1.00989 0.77671 0.14102 0.09215 Time: 0.00072
17-03-26 02:11:48 [1] Train Extra: lr=0.0000609 inv=0.4045313 sub=0.0000000
17-03-26 02:13:18 [1] Step: 55500 Acc: 0.66281 0.84295 Cost: 1.12933 0.81010 0.22703 0.09220 Time: 0.00077
17-03-26 02:13:18 [1] Train Extra: lr=0.0000608 inv=0.4428125 sub=0.0000000
17-03-26 02:14:16 [1] Step: 55500 Eval acc: 0.66928 0.85473 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 02:14:16 [1] Eval Extra: inv=0.3918397
17-03-26 02:15:41 [1] Step: 55600 Acc: 0.66531 0.85328 Cost: 0.84598 0.67003 0.08375 0.09220 Time: 0.00077
17-03-26 02:15:41 [1] Train Extra: lr=0.0000606 inv=0.4376563 sub=0.0000000
17-03-26 02:17:09 [1] Step: 55700 Acc: 0.66563 0.84931 Cost: 1.00064 0.70045 0.20791 0.09228 Time: 0.00075
17-03-26 02:17:09 [1] Train Extra: lr=0.0000604 inv=0.4387500 sub=0.0000000
17-03-26 02:18:34 [1] Step: 55800 Acc: 0.67188 0.84540 Cost: 1.06043 0.77380 0.19429 0.09234 Time: 0.00075
17-03-26 02:18:34 [1] Train Extra: lr=0.0000603 inv=0.4237500 sub=0.0000000
17-03-26 02:19:46 [1] Step: 55900 Acc: 0.67969 0.84753 Cost: 0.83981 0.59869 0.14873 0.09238 Time: 0.00071
17-03-26 02:19:46 [1] Train Extra: lr=0.0000601 inv=0.3998437 sub=0.0000000
17-03-26 02:20:58 [1] Step: 56000 Acc: 0.65844 0.84873 Cost: 1.13888 0.74824 0.29816 0.09248 Time: 0.00070
17-03-26 02:20:58 [1] Train Extra: lr=0.0000599 inv=0.4173438 sub=0.0000000
17-03-26 02:21:56 [1] Step: 56000 Eval acc: 0.67259 0.85413 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 02:21:56 [1] Eval Extra: inv=0.4552231
17-03-26 02:23:07 [1] Step: 56100 Acc: 0.68531 0.84946 Cost: 1.09706 0.76223 0.24229 0.09254 Time: 0.00072
17-03-26 02:23:07 [1] Train Extra: lr=0.0000597 inv=0.4153125 sub=0.0000000
17-03-26 02:24:34 [1] Step: 56200 Acc: 0.67688 0.85428 Cost: 0.83475 0.51885 0.22340 0.09250 Time: 0.00077
17-03-26 02:24:34 [1] Train Extra: lr=0.0000596 inv=0.4406250 sub=0.0000000
17-03-26 02:25:52 [1] Step: 56300 Acc: 0.67094 0.84538 Cost: 1.29064 0.92630 0.27184 0.09250 Time: 0.00073
17-03-26 02:25:52 [1] Train Extra: lr=0.0000594 inv=0.4209375 sub=0.0000000
17-03-26 02:27:10 [1] Step: 56400 Acc: 0.69875 0.85127 Cost: 1.05175 0.73031 0.22883 0.09261 Time: 0.00074
17-03-26 02:27:10 [1] Train Extra: lr=0.0000592 inv=0.4098438 sub=0.0000000
17-03-26 02:28:33 [1] Step: 56500 Acc: 0.67437 0.84306 Cost: 1.07066 0.73167 0.24633 0.09266 Time: 0.00073
17-03-26 02:28:33 [1] Train Extra: lr=0.0000590 inv=0.4320312 sub=0.0000000
17-03-26 02:29:30 [1] Step: 56500 Eval acc: 0.67149 0.85203 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 02:29:30 [1] Eval Extra: inv=0.4132619
17-03-26 02:30:47 [1] Step: 56600 Acc: 0.66250 0.83981 Cost: 1.02367 0.64395 0.28702 0.09269 Time: 0.00071
17-03-26 02:30:47 [1] Train Extra: lr=0.0000589 inv=0.4271875 sub=0.0000000
17-03-26 02:32:10 [1] Step: 56700 Acc: 0.66687 0.85285 Cost: 1.27781 0.86700 0.31805 0.09276 Time: 0.00076
17-03-26 02:32:10 [1] Train Extra: lr=0.0000587 inv=0.4062500 sub=0.0000000
17-03-26 02:33:30 [1] Step: 56800 Acc: 0.68344 0.86047 Cost: 1.07183 0.75672 0.22232 0.09279 Time: 0.00076
17-03-26 02:33:30 [1] Train Extra: lr=0.0000585 inv=0.3934375 sub=0.0000000
17-03-26 02:34:53 [1] Step: 56900 Acc: 0.66219 0.84315 Cost: 1.16570 0.85908 0.21384 0.09278 Time: 0.00073
17-03-26 02:34:53 [1] Train Extra: lr=0.0000584 inv=0.4651562 sub=0.0000000
17-03-26 02:36:13 [1] Step: 57000 Acc: 0.67750 0.85007 Cost: 0.86983 0.53852 0.23846 0.09285 Time: 0.00073
17-03-26 02:36:13 [1] Train Extra: lr=0.0000582 inv=0.4103125 sub=0.0000000
17-03-26 02:37:09 [1] Step: 57000 Eval acc: 0.68231 0.85203 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 02:37:09 [1] Eval Extra: inv=0.3974161
17-03-26 02:37:09 [1] Checkpointing with new best dev accuracy of 0.682310
17-03-26 02:38:33 [1] Step: 57100 Acc: 0.67156 0.84584 Cost: 0.95574 0.70162 0.16121 0.09291 Time: 0.00074
17-03-26 02:38:33 [1] Train Extra: lr=0.0000580 inv=0.4182812 sub=0.0000000
17-03-26 02:39:51 [1] Step: 57200 Acc: 0.67625 0.85387 Cost: 1.27218 0.92666 0.25263 0.09288 Time: 0.00076
17-03-26 02:39:51 [1] Train Extra: lr=0.0000579 inv=0.3921875 sub=0.0000000
17-03-26 02:41:09 [1] Step: 57300 Acc: 0.68469 0.84355 Cost: 0.96720 0.74768 0.12661 0.09291 Time: 0.00072
17-03-26 02:41:09 [1] Train Extra: lr=0.0000577 inv=0.4293750 sub=0.0000000
17-03-26 02:42:34 [1] Step: 57400 Acc: 0.67531 0.85521 Cost: 1.40403 1.02591 0.28516 0.09295 Time: 0.00077
17-03-26 02:42:34 [1] Train Extra: lr=0.0000575 inv=0.4275000 sub=0.0000000
17-03-26 02:43:49 [1] Step: 57500 Acc: 0.68437 0.84638 Cost: 0.68776 0.47456 0.12021 0.09298 Time: 0.00070
17-03-26 02:43:49 [1] Train Extra: lr=0.0000574 inv=0.3987500 sub=0.0000000
17-03-26 02:44:46 [1] Step: 57500 Eval acc: 0.67668 0.84830 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 02:44:46 [1] Eval Extra: inv=0.4351811
17-03-26 02:46:11 [1] Step: 57600 Acc: 0.66781 0.84087 Cost: 1.20139 0.90932 0.19911 0.09296 Time: 0.00075
17-03-26 02:46:11 [1] Train Extra: lr=0.0000572 inv=0.4251563 sub=0.0000000
17-03-26 02:47:36 [1] Step: 57700 Acc: 0.66500 0.84868 Cost: 1.14236 0.78480 0.26460 0.09296 Time: 0.00075
17-03-26 02:47:36 [1] Train Extra: lr=0.0000570 inv=0.4595313 sub=0.0000000
17-03-26 02:49:01 [1] Step: 57800 Acc: 0.67937 0.84582 Cost: 0.64413 0.41948 0.13163 0.09302 Time: 0.00076
17-03-26 02:49:01 [1] Train Extra: lr=0.0000569 inv=0.4206250 sub=0.0000000
17-03-26 02:50:19 [1] Step: 57900 Acc: 0.68000 0.85181 Cost: 1.02106 0.67918 0.24886 0.09302 Time: 0.00073
17-03-26 02:50:19 [1] Train Extra: lr=0.0000567 inv=0.4193750 sub=0.0000000
17-03-26 02:51:38 [1] Step: 58000 Acc: 0.66812 0.84954 Cost: 0.93308 0.70327 0.13675 0.09307 Time: 0.00072
17-03-26 02:51:38 [1] Train Extra: lr=0.0000566 inv=0.4362500 sub=0.0000000
17-03-26 02:52:35 [1] Step: 58000 Eval acc: 0.67557 0.85076 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 02:52:35 [1] Eval Extra: inv=0.4278931
17-03-26 02:53:51 [1] Step: 58100 Acc: 0.68031 0.85081 Cost: 0.81428 0.53323 0.18797 0.09308 Time: 0.00073
17-03-26 02:53:51 [1] Train Extra: lr=0.0000564 inv=0.3948437 sub=0.0000000
17-03-26 02:55:15 [1] Step: 58200 Acc: 0.67469 0.84826 Cost: 1.04935 0.73419 0.22197 0.09319 Time: 0.00074
17-03-26 02:55:15 [1] Train Extra: lr=0.0000562 inv=0.4321875 sub=0.0000000
17-03-26 02:56:29 [1] Step: 58300 Acc: 0.68156 0.84917 Cost: 1.20697 0.87216 0.24164 0.09317 Time: 0.00072
17-03-26 02:56:29 [1] Train Extra: lr=0.0000561 inv=0.4007812 sub=0.0000000
17-03-26 02:57:54 [1] Step: 58400 Acc: 0.67781 0.85289 Cost: 1.01071 0.76919 0.14831 0.09321 Time: 0.00078
17-03-26 02:57:54 [1] Train Extra: lr=0.0000559 inv=0.4079687 sub=0.0000000
17-03-26 02:59:14 [1] Step: 58500 Acc: 0.67719 0.84710 Cost: 0.99357 0.71926 0.18108 0.09324 Time: 0.00071
17-03-26 02:59:14 [1] Train Extra: lr=0.0000557 inv=0.4421875 sub=0.0000000
17-03-26 03:00:14 [1] Step: 58500 Eval acc: 0.68242 0.85178 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00018
17-03-26 03:00:14 [1] Eval Extra: inv=0.4543949
17-03-26 03:01:47 [1] Step: 58600 Acc: 0.66187 0.84846 Cost: 1.01817 0.62202 0.30293 0.09322 Time: 0.00076
17-03-26 03:01:47 [1] Train Extra: lr=0.0000556 inv=0.4595313 sub=0.0000000
17-03-26 03:03:06 [1] Step: 58700 Acc: 0.67781 0.84085 Cost: 1.07322 0.71144 0.26850 0.09328 Time: 0.00071
17-03-26 03:03:06 [1] Train Extra: lr=0.0000554 inv=0.4337500 sub=0.0000000
17-03-26 03:04:33 [1] Step: 58800 Acc: 0.68500 0.84414 Cost: 1.42155 1.04953 0.27863 0.09339 Time: 0.00075
17-03-26 03:04:33 [1] Train Extra: lr=0.0000553 inv=0.4537500 sub=0.0000000
17-03-26 03:05:51 [1] Step: 58900 Acc: 0.70250 0.85441 Cost: 0.93527 0.58983 0.25193 0.09351 Time: 0.00075
17-03-26 03:05:51 [1] Train Extra: lr=0.0000551 inv=0.4075000 sub=0.0000000
17-03-26 03:07:16 [1] Step: 59000 Acc: 0.70656 0.83639 Cost: 1.02009 0.72073 0.20570 0.09366 Time: 0.00073
17-03-26 03:07:16 [1] Train Extra: lr=0.0000550 inv=0.4593750 sub=0.0000000
17-03-26 03:08:14 [1] Step: 59000 Eval acc: 0.67756 0.85481 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 03:08:14 [1] Eval Extra: inv=0.4026060
17-03-26 03:09:39 [1] Step: 59100 Acc: 0.67906 0.85317 Cost: 1.21642 0.88636 0.23622 0.09384 Time: 0.00076
17-03-26 03:09:39 [1] Train Extra: lr=0.0000548 inv=0.4196875 sub=0.0000000
17-03-26 03:11:03 [1] Step: 59200 Acc: 0.70688 0.85104 Cost: 1.06598 0.84262 0.12940 0.09396 Time: 0.00076
17-03-26 03:11:03 [1] Train Extra: lr=0.0000546 inv=0.4267187 sub=0.0000000
17-03-26 03:12:23 [1] Step: 59300 Acc: 0.69688 0.85623 Cost: 0.92578 0.66944 0.16226 0.09408 Time: 0.00076
17-03-26 03:12:23 [1] Train Extra: lr=0.0000545 inv=0.4107812 sub=0.0000000
17-03-26 03:13:52 [1] Step: 59400 Acc: 0.70750 0.84462 Cost: 1.18765 0.82548 0.26797 0.09420 Time: 0.00076
17-03-26 03:13:52 [1] Train Extra: lr=0.0000543 inv=0.4528125 sub=0.0000000
17-03-26 03:15:16 [1] Step: 59500 Acc: 0.69219 0.85398 Cost: 1.22087 0.88786 0.23873 0.09428 Time: 0.00077
17-03-26 03:15:16 [1] Train Extra: lr=0.0000542 inv=0.4203125 sub=0.0000000
17-03-26 03:16:11 [1] Step: 59500 Eval acc: 0.67856 0.85275 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-26 03:16:11 [1] Eval Extra: inv=0.4469413
17-03-26 03:17:24 [1] Step: 59600 Acc: 0.69719 0.84988 Cost: 1.14343 0.82667 0.22230 0.09446 Time: 0.00070
17-03-26 03:17:24 [1] Train Extra: lr=0.0000540 inv=0.4106250 sub=0.0000000
17-03-26 03:18:49 [1] Step: 59700 Acc: 0.69688 0.85261 Cost: 1.23717 0.86326 0.27940 0.09451 Time: 0.00076
17-03-26 03:18:49 [1] Train Extra: lr=0.0000539 inv=0.4239062 sub=0.0000000
17-03-26 03:20:13 [1] Step: 59800 Acc: 0.69563 0.84021 Cost: 0.95351 0.64413 0.21480 0.09458 Time: 0.00075
17-03-26 03:20:13 [1] Train Extra: lr=0.0000537 inv=0.4296875 sub=0.0000000
17-03-26 03:21:34 [1] Step: 59900 Acc: 0.70531 0.85098 Cost: 0.83089 0.60581 0.13036 0.09472 Time: 0.00075
17-03-26 03:21:34 [1] Train Extra: lr=0.0000535 inv=0.3959375 sub=0.0000000
17-03-26 03:23:05 [1] Step: 60000 Acc: 0.68937 0.85321 Cost: 1.09860 0.86259 0.14117 0.09484 Time: 0.00079
17-03-26 03:23:05 [1] Train Extra: lr=0.0000534 inv=0.4167188 sub=0.0000000
17-03-26 03:24:02 [1] Step: 60000 Eval acc: 0.67690 0.85032 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 03:24:02 [1] Eval Extra: inv=0.4142557
17-03-26 03:24:02 [1] Checkpointing.
17-03-26 03:25:22 [1] Step: 60100 Acc: 0.69344 0.84789 Cost: 0.93445 0.70547 0.13398 0.09500 Time: 0.00072
17-03-26 03:25:22 [1] Train Extra: lr=0.0000532 inv=0.4412500 sub=0.0000000
17-03-26 03:26:41 [1] Step: 60200 Acc: 0.69656 0.85010 Cost: 1.13592 0.90314 0.13768 0.09510 Time: 0.00074
17-03-26 03:26:41 [1] Train Extra: lr=0.0000531 inv=0.4040625 sub=0.0000000
17-03-26 03:27:59 [1] Step: 60300 Acc: 0.70906 0.84624 Cost: 1.01380 0.78813 0.13053 0.09515 Time: 0.00072
17-03-26 03:27:59 [1] Train Extra: lr=0.0000529 inv=0.4339062 sub=0.0000000
17-03-26 03:29:24 [1] Step: 60400 Acc: 0.68031 0.85622 Cost: 0.89272 0.60834 0.18915 0.09523 Time: 0.00078
17-03-26 03:29:24 [1] Train Extra: lr=0.0000528 inv=0.4260937 sub=0.0000000
17-03-26 03:30:48 [1] Step: 60500 Acc: 0.69281 0.84462 Cost: 0.72677 0.52424 0.10711 0.09541 Time: 0.00074
17-03-26 03:30:48 [1] Train Extra: lr=0.0000526 inv=0.4420312 sub=0.0000000
17-03-26 03:31:45 [1] Step: 60500 Eval acc: 0.67712 0.84621 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 03:31:45 [1] Eval Extra: inv=0.4531250
17-03-26 03:32:59 [1] Step: 60600 Acc: 0.69719 0.84908 Cost: 0.91875 0.65489 0.16843 0.09544 Time: 0.00072
17-03-26 03:32:59 [1] Train Extra: lr=0.0000525 inv=0.4142188 sub=0.0000000
17-03-26 03:34:32 [1] Step: 60700 Acc: 0.66687 0.85149 Cost: 1.19659 0.85983 0.24120 0.09556 Time: 0.00079
17-03-26 03:34:32 [1] Train Extra: lr=0.0000523 inv=0.4556250 sub=0.0000000
17-03-26 03:35:56 [1] Step: 60800 Acc: 0.69250 0.84412 Cost: 1.08044 0.67376 0.31103 0.09565 Time: 0.00074
17-03-26 03:35:56 [1] Train Extra: lr=0.0000522 inv=0.4520312 sub=0.0000000
17-03-26 03:37:16 [1] Step: 60900 Acc: 0.68937 0.84847 Cost: 1.35683 1.05703 0.20411 0.09569 Time: 0.00072
17-03-26 03:37:16 [1] Train Extra: lr=0.0000520 inv=0.4364062 sub=0.0000000
17-03-26 03:38:42 [1] Step: 61000 Acc: 0.69219 0.84298 Cost: 0.82688 0.54954 0.18164 0.09571 Time: 0.00075
17-03-26 03:38:42 [1] Train Extra: lr=0.0000519 inv=0.4503125 sub=0.0000000
17-03-26 03:39:40 [1] Step: 61000 Eval acc: 0.67337 0.85286 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 03:39:40 [1] Eval Extra: inv=0.4044280
17-03-26 03:41:04 [1] Step: 61100 Acc: 0.68812 0.84764 Cost: 1.18894 0.82227 0.27093 0.09573 Time: 0.00078
17-03-26 03:41:04 [1] Train Extra: lr=0.0000517 inv=0.4178125 sub=0.0000000
17-03-26 03:42:32 [1] Step: 61200 Acc: 0.68094 0.85443 Cost: 0.88011 0.59305 0.19123 0.09583 Time: 0.00076
17-03-26 03:42:32 [1] Train Extra: lr=0.0000516 inv=0.4489063 sub=0.0000000
17-03-26 03:43:47 [1] Step: 61300 Acc: 0.68656 0.84695 Cost: 0.75654 0.48818 0.17245 0.09590 Time: 0.00071
17-03-26 03:43:47 [1] Train Extra: lr=0.0000514 inv=0.3950000 sub=0.0000000
17-03-26 03:45:10 [1] Step: 61400 Acc: 0.69094 0.85012 Cost: 1.45965 1.07637 0.28733 0.09594 Time: 0.00076
17-03-26 03:45:10 [1] Train Extra: lr=0.0000513 inv=0.4290625 sub=0.0000000
17-03-26 03:46:36 [1] Step: 61500 Acc: 0.70312 0.84592 Cost: 1.24322 0.82182 0.32540 0.09600 Time: 0.00074
17-03-26 03:46:36 [1] Train Extra: lr=0.0000511 inv=0.4364062 sub=0.0000000
17-03-26 03:47:33 [1] Step: 61500 Eval acc: 0.67734 0.85594 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 03:47:33 [1] Eval Extra: inv=0.4369479
17-03-26 03:48:46 [1] Step: 61600 Acc: 0.67875 0.84487 Cost: 1.18054 0.84922 0.23523 0.09608 Time: 0.00071
17-03-26 03:48:46 [1] Train Extra: lr=0.0000510 inv=0.4276563 sub=0.0000000
17-03-26 03:50:03 [1] Step: 61700 Acc: 0.68437 0.84537 Cost: 1.18870 0.79669 0.29593 0.09607 Time: 0.00072
17-03-26 03:50:03 [1] Train Extra: lr=0.0000508 inv=0.4182812 sub=0.0000000
17-03-26 03:51:34 [1] Step: 61800 Acc: 0.68125 0.84613 Cost: 0.97513 0.71616 0.16284 0.09613 Time: 0.00076
17-03-26 03:51:34 [1] Train Extra: lr=0.0000507 inv=0.4342187 sub=0.0000000
17-03-26 03:52:41 [1] Step: 61900 Acc: 0.67906 0.85328 Cost: 0.95767 0.68657 0.17483 0.09627 Time: 0.00069
17-03-26 03:52:41 [1] Train Extra: lr=0.0000506 inv=0.3590625 sub=0.0000000
17-03-26 03:54:03 [1] Step: 62000 Acc: 0.68312 0.85599 Cost: 1.05515 0.71397 0.24487 0.09631 Time: 0.00076
17-03-26 03:54:03 [1] Train Extra: lr=0.0000504 inv=0.4339062 sub=0.0000000
17-03-26 03:54:59 [1] Step: 62000 Eval acc: 0.67326 0.85106 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 03:54:59 [1] Eval Extra: inv=0.4327518
17-03-26 03:56:14 [1] Step: 62100 Acc: 0.68750 0.84452 Cost: 0.86969 0.55087 0.22251 0.09631 Time: 0.00071
17-03-26 03:56:14 [1] Train Extra: lr=0.0000503 inv=0.4325000 sub=0.0000000
17-03-26 03:57:31 [1] Step: 62200 Acc: 0.67969 0.84832 Cost: 1.25090 0.90801 0.24653 0.09635 Time: 0.00074
17-03-26 03:57:31 [1] Train Extra: lr=0.0000501 inv=0.3978125 sub=0.0000000
17-03-26 03:58:51 [1] Step: 62300 Acc: 0.70281 0.85271 Cost: 0.81611 0.63086 0.08882 0.09644 Time: 0.00074
17-03-26 03:58:51 [1] Train Extra: lr=0.0000500 inv=0.4062500 sub=0.0000000
17-03-26 04:00:10 [1] Step: 62400 Acc: 0.68563 0.85063 Cost: 1.27897 0.94702 0.23544 0.09650 Time: 0.00074
17-03-26 04:00:10 [1] Train Extra: lr=0.0000498 inv=0.4193750 sub=0.0000000
17-03-26 04:01:27 [1] Step: 62500 Acc: 0.67312 0.84921 Cost: 0.87433 0.65858 0.11920 0.09656 Time: 0.00074
17-03-26 04:01:27 [1] Train Extra: lr=0.0000497 inv=0.4121875 sub=0.0000000
17-03-26 04:02:24 [1] Step: 62500 Eval acc: 0.68143 0.84520 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 04:02:24 [1] Eval Extra: inv=0.4335799
17-03-26 04:03:49 [1] Step: 62600 Acc: 0.67969 0.84806 Cost: 1.09092 0.86108 0.13315 0.09669 Time: 0.00077
17-03-26 04:03:49 [1] Train Extra: lr=0.0000495 inv=0.4121875 sub=0.0000000
17-03-26 04:05:16 [1] Step: 62700 Acc: 0.68031 0.84802 Cost: 0.96640 0.66985 0.19984 0.09671 Time: 0.00075
17-03-26 04:05:16 [1] Train Extra: lr=0.0000494 inv=0.4403125 sub=0.0000000
17-03-26 04:06:42 [1] Step: 62800 Acc: 0.68969 0.84175 Cost: 0.95713 0.55618 0.30422 0.09672 Time: 0.00075
17-03-26 04:06:42 [1] Train Extra: lr=0.0000493 inv=0.4382813 sub=0.0000000
17-03-26 04:08:09 [1] Step: 62900 Acc: 0.68344 0.85359 Cost: 0.97885 0.71313 0.16890 0.09682 Time: 0.00077
17-03-26 04:08:09 [1] Train Extra: lr=0.0000491 inv=0.4351563 sub=0.0000000
17-03-26 04:09:35 [1] Step: 63000 Acc: 0.68594 0.84748 Cost: 0.99881 0.65325 0.24864 0.09692 Time: 0.00075
17-03-26 04:09:35 [1] Train Extra: lr=0.0000490 inv=0.4310937 sub=0.0000000
17-03-26 04:10:32 [1] Step: 63000 Eval acc: 0.68043 0.85240 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 04:10:32 [1] Eval Extra: inv=0.4100596
17-03-26 04:11:51 [1] Step: 63100 Acc: 0.68531 0.84500 Cost: 1.06268 0.71955 0.24622 0.09692 Time: 0.00073
17-03-26 04:11:51 [1] Train Extra: lr=0.0000488 inv=0.4162500 sub=0.0000000
17-03-26 04:13:11 [1] Step: 63200 Acc: 0.67812 0.84976 Cost: 1.04486 0.75652 0.19135 0.09700 Time: 0.00074
17-03-26 04:13:11 [1] Train Extra: lr=0.0000487 inv=0.4162500 sub=0.0000000
17-03-26 04:14:23 [1] Step: 63300 Acc: 0.69656 0.84648 Cost: 0.97656 0.57865 0.30090 0.09701 Time: 0.00072
17-03-26 04:14:23 [1] Train Extra: lr=0.0000486 inv=0.4140625 sub=0.0000000
17-03-26 04:15:42 [1] Step: 63400 Acc: 0.68750 0.84773 Cost: 1.03998 0.74994 0.19292 0.09712 Time: 0.00073
17-03-26 04:15:42 [1] Train Extra: lr=0.0000484 inv=0.4226563 sub=0.0000000
17-03-26 04:16:59 [1] Step: 63500 Acc: 0.69594 0.85376 Cost: 0.98297 0.76794 0.11786 0.09717 Time: 0.00076
17-03-26 04:16:59 [1] Train Extra: lr=0.0000483 inv=0.4045313 sub=0.0000000
17-03-26 04:17:57 [1] Step: 63500 Eval acc: 0.68264 0.85433 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 04:17:57 [1] Eval Extra: inv=0.4058635
17-03-26 04:19:16 [1] Step: 63600 Acc: 0.67500 0.84334 Cost: 1.11870 0.77420 0.24724 0.09726 Time: 0.00071
17-03-26 04:19:16 [1] Train Extra: lr=0.0000481 inv=0.4515625 sub=0.0000000
17-03-26 04:20:29 [1] Step: 63700 Acc: 0.69125 0.85384 Cost: 0.91998 0.71636 0.10639 0.09723 Time: 0.00072
17-03-26 04:20:29 [1] Train Extra: lr=0.0000480 inv=0.4004687 sub=0.0000000
17-03-26 04:21:47 [1] Step: 63800 Acc: 0.68250 0.84579 Cost: 1.06185 0.68298 0.28162 0.09726 Time: 0.00074
17-03-26 04:21:47 [1] Train Extra: lr=0.0000479 inv=0.4095313 sub=0.0000000
17-03-26 04:23:05 [1] Step: 63900 Acc: 0.68063 0.85149 Cost: 1.17434 0.80517 0.27190 0.09726 Time: 0.00073
17-03-26 04:23:05 [1] Train Extra: lr=0.0000477 inv=0.3942188 sub=0.0000000
17-03-26 04:24:25 [1] Step: 64000 Acc: 0.68875 0.84202 Cost: 1.12499 0.80275 0.22496 0.09728 Time: 0.00071
17-03-26 04:24:25 [1] Train Extra: lr=0.0000476 inv=0.4218750 sub=0.0000000
17-03-26 04:25:22 [1] Step: 64000 Eval acc: 0.68087 0.85239 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 04:25:22 [1] Eval Extra: inv=0.4049249
17-03-26 04:26:47 [1] Step: 64100 Acc: 0.67406 0.83962 Cost: 1.00618 0.72499 0.18387 0.09732 Time: 0.00073
17-03-26 04:26:47 [1] Train Extra: lr=0.0000475 inv=0.4595313 sub=0.0000000
17-03-26 04:28:28 [1] Step: 64200 Acc: 0.67906 0.85242 Cost: 1.00531 0.63418 0.27369 0.09744 Time: 0.00082
17-03-26 04:28:28 [1] Train Extra: lr=0.0000473 inv=0.4526562 sub=0.0000000
17-03-26 04:29:42 [1] Step: 64300 Acc: 0.68594 0.85105 Cost: 1.19712 0.89059 0.20906 0.09747 Time: 0.00072
17-03-26 04:29:42 [1] Train Extra: lr=0.0000472 inv=0.4070313 sub=0.0000000
17-03-26 04:31:05 [1] Step: 64400 Acc: 0.69219 0.85039 Cost: 1.39636 1.01518 0.28368 0.09750 Time: 0.00077
17-03-26 04:31:05 [1] Train Extra: lr=0.0000470 inv=0.4096875 sub=0.0000000
17-03-26 04:32:20 [1] Step: 64500 Acc: 0.69688 0.85012 Cost: 1.00986 0.83013 0.08215 0.09758 Time: 0.00072
17-03-26 04:32:20 [1] Train Extra: lr=0.0000469 inv=0.3940625 sub=0.0000000
17-03-26 04:33:17 [1] Step: 64500 Eval acc: 0.68165 0.85002 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 04:33:17 [1] Eval Extra: inv=0.4223719
17-03-26 04:34:38 [1] Step: 64600 Acc: 0.67031 0.84458 Cost: 1.05763 0.69682 0.26317 0.09764 Time: 0.00075
17-03-26 04:34:38 [1] Train Extra: lr=0.0000468 inv=0.4292187 sub=0.0000000
17-03-26 04:36:03 [1] Step: 64700 Acc: 0.67656 0.84243 Cost: 0.94802 0.66359 0.18674 0.09770 Time: 0.00074
17-03-26 04:36:03 [1] Train Extra: lr=0.0000466 inv=0.4318750 sub=0.0000000
17-03-26 04:37:31 [1] Step: 64800 Acc: 0.66531 0.84084 Cost: 0.98410 0.71398 0.17242 0.09770 Time: 0.00074
17-03-26 04:37:31 [1] Train Extra: lr=0.0000465 inv=0.4506250 sub=0.0000000
17-03-26 04:39:00 [1] Step: 64900 Acc: 0.68344 0.85154 Cost: 1.09203 0.80025 0.19405 0.09772 Time: 0.00079
17-03-26 04:39:00 [1] Train Extra: lr=0.0000464 inv=0.4303125 sub=0.0000000
17-03-26 04:40:33 [1] Step: 65000 Acc: 0.66875 0.85609 Cost: 1.05891 0.80943 0.15159 0.09789 Time: 0.00080
17-03-26 04:40:33 [1] Train Extra: lr=0.0000462 inv=0.4168750 sub=0.0000000
17-03-26 04:41:30 [1] Step: 65000 Eval acc: 0.68518 0.85347 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 04:41:30 [1] Eval Extra: inv=0.4139245
17-03-26 04:41:30 [1] Checkpointing.
17-03-26 04:42:55 [1] Step: 65100 Acc: 0.67500 0.84567 Cost: 0.94274 0.68124 0.16361 0.09790 Time: 0.00075
17-03-26 04:42:55 [1] Train Extra: lr=0.0000461 inv=0.4425000 sub=0.0000000
17-03-26 04:44:13 [1] Step: 65200 Acc: 0.68563 0.84764 Cost: 0.97113 0.65440 0.21883 0.09790 Time: 0.00073
17-03-26 04:44:13 [1] Train Extra: lr=0.0000460 inv=0.4279688 sub=0.0000000
17-03-26 04:45:43 [1] Step: 65300 Acc: 0.67969 0.85130 Cost: 1.05259 0.72847 0.22620 0.09793 Time: 0.00078
17-03-26 04:45:43 [1] Train Extra: lr=0.0000458 inv=0.4143750 sub=0.0000000
17-03-26 04:47:15 [1] Step: 65400 Acc: 0.65938 0.84015 Cost: 1.16931 0.92922 0.14216 0.09793 Time: 0.00075
17-03-26 04:47:15 [1] Train Extra: lr=0.0000457 inv=0.4512500 sub=0.0000000
17-03-26 04:48:36 [1] Step: 65500 Acc: 0.67188 0.84746 Cost: 1.20490 0.84601 0.26094 0.09794 Time: 0.00072
17-03-26 04:48:36 [1] Train Extra: lr=0.0000456 inv=0.4232812 sub=0.0000000
17-03-26 04:49:33 [1] Step: 65500 Eval acc: 0.67955 0.85600 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 04:49:33 [1] Eval Extra: inv=0.3786992
17-03-26 04:50:51 [1] Step: 65600 Acc: 0.67406 0.85400 Cost: 1.31820 0.96579 0.25447 0.09793 Time: 0.00074
17-03-26 04:50:51 [1] Train Extra: lr=0.0000454 inv=0.4035937 sub=0.0000000
17-03-26 04:52:10 [1] Step: 65700 Acc: 0.67875 0.85476 Cost: 0.85184 0.66539 0.08846 0.09799 Time: 0.00076
17-03-26 04:52:10 [1] Train Extra: lr=0.0000453 inv=0.3859375 sub=0.0000000
17-03-26 04:53:35 [1] Step: 65800 Acc: 0.70250 0.85485 Cost: 0.95355 0.70626 0.14923 0.09805 Time: 0.00076
17-03-26 04:53:35 [1] Train Extra: lr=0.0000452 inv=0.4270312 sub=0.0000000
17-03-26 04:54:59 [1] Step: 65900 Acc: 0.67125 0.84315 Cost: 1.34555 0.98401 0.26342 0.09812 Time: 0.00074
17-03-26 04:54:59 [1] Train Extra: lr=0.0000451 inv=0.4125000 sub=0.0000000
17-03-26 04:56:24 [1] Step: 66000 Acc: 0.67563 0.85611 Cost: 0.71365 0.51625 0.09924 0.09816 Time: 0.00076
17-03-26 04:56:24 [1] Train Extra: lr=0.0000449 inv=0.4125000 sub=0.0000000
17-03-26 04:57:20 [1] Step: 66000 Eval acc: 0.67646 0.85215 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 04:57:20 [1] Eval Extra: inv=0.4134828
17-03-26 04:58:40 [1] Step: 66100 Acc: 0.69250 0.84846 Cost: 1.21804 0.95732 0.16258 0.09815 Time: 0.00074
17-03-26 04:58:40 [1] Train Extra: lr=0.0000448 inv=0.4001562 sub=0.0000000
17-03-26 05:00:05 [1] Step: 66200 Acc: 0.69719 0.85199 Cost: 0.95178 0.65850 0.19514 0.09814 Time: 0.00075
17-03-26 05:00:05 [1] Train Extra: lr=0.0000447 inv=0.4273438 sub=0.0000000
17-03-26 05:01:29 [1] Step: 66300 Acc: 0.67875 0.84679 Cost: 0.86939 0.64896 0.12219 0.09824 Time: 0.00074
17-03-26 05:01:29 [1] Train Extra: lr=0.0000445 inv=0.4415625 sub=0.0000000
17-03-26 05:02:47 [1] Step: 66400 Acc: 0.67750 0.84888 Cost: 1.16561 0.82614 0.24124 0.09824 Time: 0.00073
17-03-26 05:02:47 [1] Train Extra: lr=0.0000444 inv=0.4021875 sub=0.0000000
17-03-26 05:04:07 [1] Step: 66500 Acc: 0.69000 0.85314 Cost: 1.34012 1.04829 0.19361 0.09822 Time: 0.00074
17-03-26 05:04:07 [1] Train Extra: lr=0.0000443 inv=0.4245313 sub=0.0000000
17-03-26 05:05:03 [1] Step: 66500 Eval acc: 0.67635 0.84986 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 05:05:03 [1] Eval Extra: inv=0.4051458
17-03-26 05:06:15 [1] Step: 66600 Acc: 0.67437 0.85073 Cost: 1.28739 0.96503 0.22412 0.09824 Time: 0.00072
17-03-26 05:06:15 [1] Train Extra: lr=0.0000442 inv=0.4006250 sub=0.0000000
17-03-26 05:07:33 [1] Step: 66700 Acc: 0.67625 0.85091 Cost: 0.82808 0.54083 0.18901 0.09825 Time: 0.00073
17-03-26 05:07:33 [1] Train Extra: lr=0.0000440 inv=0.4100000 sub=0.0000000
17-03-26 05:08:54 [1] Step: 66800 Acc: 0.67937 0.84899 Cost: 0.90732 0.66839 0.14066 0.09828 Time: 0.00075
17-03-26 05:08:54 [1] Train Extra: lr=0.0000439 inv=0.4239062 sub=0.0000000
17-03-26 05:10:20 [1] Step: 66900 Acc: 0.67594 0.84405 Cost: 1.37569 1.03644 0.24095 0.09829 Time: 0.00073
17-03-26 05:10:20 [1] Train Extra: lr=0.0000438 inv=0.4381250 sub=0.0000000
17-03-26 05:11:40 [1] Step: 67000 Acc: 0.67812 0.84780 Cost: 1.02953 0.69865 0.23250 0.09838 Time: 0.00073
17-03-26 05:11:40 [1] Train Extra: lr=0.0000437 inv=0.4210937 sub=0.0000000
17-03-26 05:12:37 [1] Step: 67000 Eval acc: 0.68065 0.85192 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 05:12:37 [1] Eval Extra: inv=0.4140349
17-03-26 05:13:55 [1] Step: 67100 Acc: 0.67312 0.85156 Cost: 1.22445 0.90666 0.21941 0.09838 Time: 0.00074
17-03-26 05:13:55 [1] Train Extra: lr=0.0000435 inv=0.4101562 sub=0.0000000
17-03-26 05:15:27 [1] Step: 67200 Acc: 0.69437 0.85301 Cost: 1.13704 0.81034 0.22826 0.09844 Time: 0.00080
17-03-26 05:15:27 [1] Train Extra: lr=0.0000434 inv=0.4171875 sub=0.0000000
17-03-26 05:16:44 [1] Step: 67300 Acc: 0.70312 0.84881 Cost: 0.84087 0.54284 0.19944 0.09858 Time: 0.00073
17-03-26 05:16:44 [1] Train Extra: lr=0.0000433 inv=0.4168750 sub=0.0000000
17-03-26 05:18:02 [1] Step: 67400 Acc: 0.69281 0.85231 Cost: 0.92341 0.62397 0.20073 0.09871 Time: 0.00074
17-03-26 05:18:02 [1] Train Extra: lr=0.0000432 inv=0.4056250 sub=0.0000000
17-03-26 05:19:21 [1] Step: 67500 Acc: 0.71375 0.84334 Cost: 0.98143 0.76899 0.11369 0.09875 Time: 0.00072
17-03-26 05:19:21 [1] Train Extra: lr=0.0000430 inv=0.4148438 sub=0.0000000
17-03-26 05:20:18 [1] Step: 67500 Eval acc: 0.68695 0.85508 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 05:20:18 [1] Eval Extra: inv=0.3763803
17-03-26 05:20:18 [1] Checkpointing with new best dev accuracy of 0.686948
17-03-26 05:21:43 [1] Step: 67600 Acc: 0.70969 0.84911 Cost: 1.04198 0.64558 0.29755 0.09885 Time: 0.00074
17-03-26 05:21:43 [1] Train Extra: lr=0.0000429 inv=0.4107812 sub=0.0000000
17-03-26 05:23:02 [1] Step: 67700 Acc: 0.70406 0.84913 Cost: 0.74116 0.46248 0.17973 0.09895 Time: 0.00073
17-03-26 05:23:02 [1] Train Extra: lr=0.0000428 inv=0.4142188 sub=0.0000000
17-03-26 05:24:34 [1] Step: 67800 Acc: 0.69812 0.84520 Cost: 0.83960 0.54849 0.19201 0.09909 Time: 0.00076
17-03-26 05:24:34 [1] Train Extra: lr=0.0000427 inv=0.4459375 sub=0.0000000
17-03-26 05:26:02 [1] Step: 67900 Acc: 0.69063 0.84642 Cost: 0.95346 0.68860 0.16567 0.09918 Time: 0.00076
17-03-26 05:26:02 [1] Train Extra: lr=0.0000425 inv=0.4379688 sub=0.0000000
17-03-26 05:27:09 [1] Step: 68000 Acc: 0.71625 0.85204 Cost: 0.86426 0.64770 0.11729 0.09927 Time: 0.00070
17-03-26 05:27:09 [1] Train Extra: lr=0.0000424 inv=0.3837500 sub=0.0000000
17-03-26 05:28:05 [1] Step: 68000 Eval acc: 0.67524 0.85044 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 05:28:05 [1] Eval Extra: inv=0.4600817
17-03-26 05:29:27 [1] Step: 68100 Acc: 0.71188 0.84957 Cost: 0.91541 0.63889 0.17713 0.09939 Time: 0.00075
17-03-26 05:29:27 [1] Train Extra: lr=0.0000423 inv=0.4176563 sub=0.0000000
17-03-26 05:30:54 [1] Step: 68200 Acc: 0.71250 0.85500 Cost: 0.67252 0.39155 0.18144 0.09953 Time: 0.00077
17-03-26 05:30:54 [1] Train Extra: lr=0.0000422 inv=0.4239062 sub=0.0000000
17-03-26 05:32:19 [1] Step: 68300 Acc: 0.70844 0.84611 Cost: 0.88107 0.63399 0.14744 0.09964 Time: 0.00075
17-03-26 05:32:19 [1] Train Extra: lr=0.0000421 inv=0.4262500 sub=0.0000000
17-03-26 05:33:38 [1] Step: 68400 Acc: 0.69406 0.84428 Cost: 0.91453 0.63478 0.17998 0.09977 Time: 0.00074
17-03-26 05:33:38 [1] Train Extra: lr=0.0000419 inv=0.4145313 sub=0.0000000
17-03-26 05:34:52 [1] Step: 68500 Acc: 0.70437 0.84650 Cost: 1.10475 0.80009 0.20479 0.09987 Time: 0.00071
17-03-26 05:34:52 [1] Train Extra: lr=0.0000418 inv=0.3937500 sub=0.0000000
17-03-26 05:35:50 [1] Step: 68500 Eval acc: 0.67314 0.85113 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 05:35:50 [1] Eval Extra: inv=0.4027164
17-03-26 05:37:13 [1] Step: 68600 Acc: 0.70312 0.84366 Cost: 1.21656 0.82883 0.28780 0.09992 Time: 0.00076
17-03-26 05:37:13 [1] Train Extra: lr=0.0000417 inv=0.4187500 sub=0.0000000
17-03-26 05:38:42 [1] Step: 68700 Acc: 0.71281 0.84670 Cost: 1.05886 0.72272 0.23612 0.10001 Time: 0.00074
17-03-26 05:38:42 [1] Train Extra: lr=0.0000416 inv=0.4537500 sub=0.0000000
17-03-26 05:40:13 [1] Step: 68800 Acc: 0.68812 0.85710 Cost: 1.18649 0.88711 0.19928 0.10011 Time: 0.00079
17-03-26 05:40:13 [1] Train Extra: lr=0.0000415 inv=0.4373437 sub=0.0000000
17-03-26 05:41:38 [1] Step: 68900 Acc: 0.68781 0.84467 Cost: 1.07495 0.80125 0.17352 0.10017 Time: 0.00073
17-03-26 05:41:38 [1] Train Extra: lr=0.0000413 inv=0.4456250 sub=0.0000000
17-03-26 05:42:52 [1] Step: 69000 Acc: 0.70469 0.85159 Cost: 1.03499 0.70872 0.22601 0.10026 Time: 0.00073
17-03-26 05:42:52 [1] Train Extra: lr=0.0000412 inv=0.4057812 sub=0.0000000
17-03-26 05:43:50 [1] Step: 69000 Eval acc: 0.68220 0.85108 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 05:43:50 [1] Eval Extra: inv=0.4628423
17-03-26 05:45:15 [1] Step: 69100 Acc: 0.68781 0.84678 Cost: 1.01937 0.76514 0.15390 0.10033 Time: 0.00076
17-03-26 05:45:15 [1] Train Extra: lr=0.0000411 inv=0.4339062 sub=0.0000000
17-03-26 05:46:32 [1] Step: 69200 Acc: 0.69344 0.84908 Cost: 1.07784 0.71309 0.26429 0.10046 Time: 0.00074
17-03-26 05:46:32 [1] Train Extra: lr=0.0000410 inv=0.4268750 sub=0.0000000
17-03-26 05:47:51 [1] Step: 69300 Acc: 0.69688 0.84835 Cost: 1.11431 0.74843 0.26529 0.10059 Time: 0.00074
17-03-26 05:47:51 [1] Train Extra: lr=0.0000409 inv=0.4064063 sub=0.0000000
17-03-26 05:49:13 [1] Step: 69400 Acc: 0.68250 0.85593 Cost: 0.93211 0.53835 0.29312 0.10064 Time: 0.00076
17-03-26 05:49:13 [1] Train Extra: lr=0.0000407 inv=0.4045313 sub=0.0000000
17-03-26 05:50:44 [1] Step: 69500 Acc: 0.68375 0.84872 Cost: 0.65301 0.44378 0.10852 0.10071 Time: 0.00077
17-03-26 05:50:44 [1] Train Extra: lr=0.0000406 inv=0.4615625 sub=0.0000000
17-03-26 05:51:41 [1] Step: 69500 Eval acc: 0.67734 0.85264 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 05:51:41 [1] Eval Extra: inv=0.4001215
17-03-26 05:52:56 [1] Step: 69600 Acc: 0.69719 0.84993 Cost: 1.01124 0.67721 0.23328 0.10076 Time: 0.00073
17-03-26 05:52:56 [1] Train Extra: lr=0.0000405 inv=0.4154687 sub=0.0000000
17-03-26 05:54:15 [1] Step: 69700 Acc: 0.70813 0.85536 Cost: 1.13679 0.77678 0.25920 0.10081 Time: 0.00076
17-03-26 05:54:15 [1] Train Extra: lr=0.0000404 inv=0.3943750 sub=0.0000000
17-03-26 05:55:47 [1] Step: 69800 Acc: 0.70688 0.85011 Cost: 0.73084 0.44426 0.18564 0.10094 Time: 0.00078
17-03-26 05:55:47 [1] Train Extra: lr=0.0000403 inv=0.4418750 sub=0.0000000
17-03-26 05:57:19 [1] Step: 69900 Acc: 0.69563 0.84702 Cost: 1.15908 0.79935 0.25865 0.10108 Time: 0.00076
17-03-26 05:57:19 [1] Train Extra: lr=0.0000402 inv=0.4521875 sub=0.0000000
17-03-26 05:58:37 [1] Step: 70000 Acc: 0.69906 0.84820 Cost: 0.95549 0.64549 0.20886 0.10115 Time: 0.00072
17-03-26 05:58:37 [1] Train Extra: lr=0.0000400 inv=0.4259375 sub=0.0000000
17-03-26 05:59:34 [1] Step: 70000 Eval acc: 0.67568 0.84897 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 05:59:34 [1] Eval Extra: inv=0.4221511
17-03-26 05:59:34 [1] Checkpointing.
17-03-26 06:00:47 [1] Step: 70100 Acc: 0.69125 0.84482 Cost: 1.18875 0.84384 0.24364 0.10126 Time: 0.00072
17-03-26 06:00:47 [1] Train Extra: lr=0.0000399 inv=0.4423437 sub=0.0000000
17-03-26 06:02:13 [1] Step: 70200 Acc: 0.68563 0.85399 Cost: 1.28517 0.87639 0.30745 0.10133 Time: 0.00077
17-03-26 06:02:13 [1] Train Extra: lr=0.0000398 inv=0.4359375 sub=0.0000000
17-03-26 06:03:30 [1] Step: 70300 Acc: 0.69563 0.85167 Cost: 1.14515 0.85280 0.19092 0.10143 Time: 0.00074
17-03-26 06:03:30 [1] Train Extra: lr=0.0000397 inv=0.4170313 sub=0.0000000
17-03-26 06:04:45 [1] Step: 70400 Acc: 0.70625 0.85197 Cost: 1.00943 0.67833 0.22952 0.10158 Time: 0.00071
17-03-26 06:04:45 [1] Train Extra: lr=0.0000396 inv=0.4214062 sub=0.0000000
17-03-26 06:06:10 [1] Step: 70500 Acc: 0.68469 0.85367 Cost: 1.24770 0.90611 0.24000 0.10160 Time: 0.00078
17-03-26 06:06:10 [1] Train Extra: lr=0.0000395 inv=0.4195313 sub=0.0000000
17-03-26 06:07:09 [1] Step: 70500 Eval acc: 0.67900 0.85448 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 06:07:09 [1] Eval Extra: inv=0.4217646
17-03-26 06:08:29 [1] Step: 70600 Acc: 0.69563 0.84556 Cost: 1.23225 0.91716 0.21341 0.10167 Time: 0.00073
17-03-26 06:08:29 [1] Train Extra: lr=0.0000394 inv=0.4323438 sub=0.0000000
17-03-26 06:09:47 [1] Step: 70700 Acc: 0.69719 0.85032 Cost: 1.00845 0.69782 0.20893 0.10171 Time: 0.00075
17-03-26 06:09:47 [1] Train Extra: lr=0.0000392 inv=0.4028125 sub=0.0000000
17-03-26 06:11:01 [1] Step: 70800 Acc: 0.69500 0.85145 Cost: 0.82622 0.60098 0.12342 0.10182 Time: 0.00071
17-03-26 06:11:01 [1] Train Extra: lr=0.0000391 inv=0.4003125 sub=0.0000000
17-03-26 06:12:26 [1] Step: 70900 Acc: 0.66969 0.84472 Cost: 1.14827 0.75464 0.29174 0.10189 Time: 0.00075
17-03-26 06:12:26 [1] Train Extra: lr=0.0000390 inv=0.4248438 sub=0.0000000
17-03-26 06:13:39 [1] Step: 71000 Acc: 0.70688 0.85583 Cost: 0.86689 0.55753 0.20740 0.10197 Time: 0.00072
17-03-26 06:13:39 [1] Train Extra: lr=0.0000389 inv=0.4148438 sub=0.0000000
17-03-26 06:14:35 [1] Step: 71000 Eval acc: 0.67469 0.84375 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 06:14:35 [1] Eval Extra: inv=0.4492602
17-03-26 06:15:53 [1] Step: 71100 Acc: 0.69594 0.84424 Cost: 1.05822 0.74920 0.20705 0.10197 Time: 0.00074
17-03-26 06:15:53 [1] Train Extra: lr=0.0000388 inv=0.4087500 sub=0.0000000
17-03-26 06:17:20 [1] Step: 71200 Acc: 0.69281 0.84738 Cost: 1.13173 0.78810 0.24161 0.10202 Time: 0.00076
17-03-26 06:17:20 [1] Train Extra: lr=0.0000387 inv=0.4370312 sub=0.0000000
17-03-26 06:18:47 [1] Step: 71300 Acc: 0.67375 0.85573 Cost: 1.08779 0.72079 0.26494 0.10207 Time: 0.00078
17-03-26 06:18:47 [1] Train Extra: lr=0.0000386 inv=0.4004687 sub=0.0000000
17-03-26 06:20:18 [1] Step: 71400 Acc: 0.67094 0.84501 Cost: 0.90941 0.63958 0.16772 0.10212 Time: 0.00076
17-03-26 06:20:18 [1] Train Extra: lr=0.0000385 inv=0.4534375 sub=0.0000000
17-03-26 06:21:37 [1] Step: 71500 Acc: 0.70344 0.84783 Cost: 0.81444 0.59665 0.11566 0.10213 Time: 0.00075
17-03-26 06:21:37 [1] Train Extra: lr=0.0000384 inv=0.4242187 sub=0.0000000
17-03-26 06:22:35 [1] Step: 71500 Eval acc: 0.67999 0.84656 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 06:22:35 [1] Eval Extra: inv=0.4278379
17-03-26 06:23:48 [1] Step: 71600 Acc: 0.69375 0.85116 Cost: 0.87603 0.55882 0.21501 0.10220 Time: 0.00071
17-03-26 06:23:48 [1] Train Extra: lr=0.0000382 inv=0.4114062 sub=0.0000000
17-03-26 06:25:13 [1] Step: 71700 Acc: 0.68875 0.84751 Cost: 1.21981 0.86619 0.25129 0.10233 Time: 0.00076
17-03-26 06:25:13 [1] Train Extra: lr=0.0000381 inv=0.4134375 sub=0.0000000
17-03-26 06:26:33 [1] Step: 71800 Acc: 0.70594 0.85057 Cost: 0.95546 0.63893 0.21412 0.10241 Time: 0.00073
17-03-26 06:26:33 [1] Train Extra: lr=0.0000380 inv=0.4195313 sub=0.0000000
17-03-26 06:27:56 [1] Step: 71900 Acc: 0.69375 0.85161 Cost: 0.84686 0.63876 0.10563 0.10246 Time: 0.00075
17-03-26 06:27:56 [1] Train Extra: lr=0.0000379 inv=0.4176563 sub=0.0000000
17-03-26 06:29:18 [1] Step: 72000 Acc: 0.70219 0.84859 Cost: 1.05856 0.79266 0.16344 0.10246 Time: 0.00073
17-03-26 06:29:18 [1] Train Extra: lr=0.0000378 inv=0.4021875 sub=0.0000000
17-03-26 06:30:16 [1] Step: 72000 Eval acc: 0.68110 0.85074 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 06:30:16 [1] Eval Extra: inv=0.3975817
17-03-26 06:31:40 [1] Step: 72100 Acc: 0.68125 0.84587 Cost: 1.26155 0.95429 0.20479 0.10247 Time: 0.00075
17-03-26 06:31:40 [1] Train Extra: lr=0.0000377 inv=0.4173438 sub=0.0000000
17-03-26 06:32:55 [1] Step: 72200 Acc: 0.69937 0.85467 Cost: 0.89122 0.58148 0.20722 0.10252 Time: 0.00074
17-03-26 06:32:55 [1] Train Extra: lr=0.0000376 inv=0.3848437 sub=0.0000000
17-03-26 06:34:25 [1] Step: 72300 Acc: 0.68781 0.84450 Cost: 0.79912 0.50788 0.18866 0.10258 Time: 0.00076
17-03-26 06:34:25 [1] Train Extra: lr=0.0000375 inv=0.4417187 sub=0.0000000
17-03-26 06:36:04 [1] Step: 72400 Acc: 0.67937 0.85022 Cost: 0.86400 0.51125 0.25015 0.10261 Time: 0.00080
17-03-26 06:36:04 [1] Train Extra: lr=0.0000374 inv=0.4275000 sub=0.0000000
17-03-26 06:37:25 [1] Step: 72500 Acc: 0.69812 0.85642 Cost: 1.10511 0.88322 0.11929 0.10260 Time: 0.00076
17-03-26 06:37:25 [1] Train Extra: lr=0.0000373 inv=0.3848437 sub=0.0000000
17-03-26 06:38:22 [1] Step: 72500 Eval acc: 0.68319 0.84982 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 06:38:22 [1] Eval Extra: inv=0.4211020
17-03-26 06:39:47 [1] Step: 72600 Acc: 0.69000 0.84865 Cost: 1.00339 0.68142 0.21938 0.10259 Time: 0.00076
17-03-26 06:39:47 [1] Train Extra: lr=0.0000372 inv=0.4151563 sub=0.0000000
17-03-26 06:41:07 [1] Step: 72700 Acc: 0.68156 0.85221 Cost: 0.89719 0.59446 0.20004 0.10270 Time: 0.00076
17-03-26 06:41:07 [1] Train Extra: lr=0.0000371 inv=0.3917188 sub=0.0000000
17-03-26 06:42:39 [1] Step: 72800 Acc: 0.69125 0.84239 Cost: 1.12727 0.81118 0.21343 0.10266 Time: 0.00077
17-03-26 06:42:39 [1] Train Extra: lr=0.0000369 inv=0.4540625 sub=0.0000000
17-03-26 06:43:58 [1] Step: 72900 Acc: 0.68719 0.84707 Cost: 1.10761 0.74983 0.25512 0.10267 Time: 0.00073
17-03-26 06:43:58 [1] Train Extra: lr=0.0000368 inv=0.4448437 sub=0.0000000
17-03-26 06:45:17 [1] Step: 73000 Acc: 0.69688 0.84748 Cost: 1.02615 0.68214 0.24136 0.10265 Time: 0.00072
17-03-26 06:45:17 [1] Train Extra: lr=0.0000367 inv=0.4562500 sub=0.0000000
17-03-26 06:46:14 [1] Step: 73000 Eval acc: 0.68087 0.85101 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 06:46:14 [1] Eval Extra: inv=0.4192800
17-03-26 06:47:36 [1] Step: 73100 Acc: 0.69063 0.84432 Cost: 1.20340 0.89697 0.20379 0.10265 Time: 0.00073
17-03-26 06:47:36 [1] Train Extra: lr=0.0000366 inv=0.4470312 sub=0.0000000
17-03-26 06:48:59 [1] Step: 73200 Acc: 0.68625 0.85171 Cost: 0.96586 0.68919 0.17392 0.10274 Time: 0.00076
17-03-26 06:48:59 [1] Train Extra: lr=0.0000365 inv=0.4298438 sub=0.0000000
17-03-26 06:50:18 [1] Step: 73300 Acc: 0.70781 0.84492 Cost: 0.97673 0.65182 0.22213 0.10278 Time: 0.00073
17-03-26 06:50:18 [1] Train Extra: lr=0.0000364 inv=0.4092188 sub=0.0000000
17-03-26 06:51:44 [1] Step: 73400 Acc: 0.68250 0.85331 Cost: 1.17196 0.80753 0.26161 0.10282 Time: 0.00077
17-03-26 06:51:44 [1] Train Extra: lr=0.0000363 inv=0.4160937 sub=0.0000000
17-03-26 06:53:05 [1] Step: 73500 Acc: 0.68688 0.84496 Cost: 1.18856 0.84121 0.24448 0.10287 Time: 0.00072
17-03-26 06:53:05 [1] Train Extra: lr=0.0000362 inv=0.4307813 sub=0.0000000
17-03-26 06:54:03 [1] Step: 73500 Eval acc: 0.67856 0.85343 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 06:54:03 [1] Eval Extra: inv=0.3680985
17-03-26 06:55:29 [1] Step: 73600 Acc: 0.69875 0.84930 Cost: 0.97184 0.75682 0.11208 0.10294 Time: 0.00076
17-03-26 06:55:29 [1] Train Extra: lr=0.0000361 inv=0.4003125 sub=0.0000000
17-03-26 06:56:55 [1] Step: 73700 Acc: 0.67625 0.85556 Cost: 0.92691 0.61836 0.20550 0.10305 Time: 0.00076
17-03-26 06:56:55 [1] Train Extra: lr=0.0000360 inv=0.4275000 sub=0.0000000
17-03-26 06:58:12 [1] Step: 73800 Acc: 0.68844 0.85340 Cost: 1.02405 0.66101 0.25998 0.10306 Time: 0.00074
17-03-26 06:58:12 [1] Train Extra: lr=0.0000359 inv=0.3918750 sub=0.0000000
17-03-26 06:59:42 [1] Step: 73900 Acc: 0.68875 0.85126 Cost: 1.31282 0.93945 0.27033 0.10305 Time: 0.00078
17-03-26 06:59:42 [1] Train Extra: lr=0.0000358 inv=0.4434375 sub=0.0000000
17-03-26 07:01:03 [1] Step: 74000 Acc: 0.69969 0.84901 Cost: 0.95442 0.76995 0.08141 0.10305 Time: 0.00074
17-03-26 07:01:03 [1] Train Extra: lr=0.0000357 inv=0.4118750 sub=0.0000000
17-03-26 07:02:01 [1] Step: 74000 Eval acc: 0.68275 0.84878 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 07:02:01 [1] Eval Extra: inv=0.4107222
17-03-26 07:03:19 [1] Step: 74100 Acc: 0.68719 0.84664 Cost: 1.06635 0.66807 0.29516 0.10313 Time: 0.00073
17-03-26 07:03:19 [1] Train Extra: lr=0.0000356 inv=0.4151563 sub=0.0000000
17-03-26 07:04:44 [1] Step: 74200 Acc: 0.70125 0.85403 Cost: 1.09624 0.72345 0.26966 0.10313 Time: 0.00078
17-03-26 07:04:44 [1] Train Extra: lr=0.0000355 inv=0.4146875 sub=0.0000000
17-03-26 07:06:03 [1] Step: 74300 Acc: 0.68656 0.85450 Cost: 1.02527 0.72351 0.19862 0.10314 Time: 0.00077
17-03-26 07:06:03 [1] Train Extra: lr=0.0000354 inv=0.4012500 sub=0.0000000
17-03-26 07:07:24 [1] Step: 74400 Acc: 0.68188 0.84506 Cost: 1.21540 0.97259 0.13955 0.10325 Time: 0.00072
17-03-26 07:07:24 [1] Train Extra: lr=0.0000353 inv=0.4667188 sub=0.0000000
17-03-26 07:08:37 [1] Step: 74500 Acc: 0.69375 0.85057 Cost: 0.88728 0.60322 0.18074 0.10331 Time: 0.00073
17-03-26 07:08:37 [1] Train Extra: lr=0.0000352 inv=0.4056250 sub=0.0000000
17-03-26 07:09:34 [1] Step: 74500 Eval acc: 0.68595 0.84970 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 07:09:34 [1] Eval Extra: inv=0.4315923
17-03-26 07:11:09 [1] Step: 74600 Acc: 0.67781 0.85792 Cost: 0.97111 0.65780 0.20994 0.10337 Time: 0.00082
17-03-26 07:11:09 [1] Train Extra: lr=0.0000351 inv=0.4289062 sub=0.0000000
17-03-26 07:12:24 [1] Step: 74700 Acc: 0.70656 0.84436 Cost: 1.01894 0.73619 0.17940 0.10335 Time: 0.00069
17-03-26 07:12:24 [1] Train Extra: lr=0.0000350 inv=0.4306250 sub=0.0000000
17-03-26 07:13:48 [1] Step: 74800 Acc: 0.67344 0.84644 Cost: 1.11817 0.77587 0.23884 0.10346 Time: 0.00074
17-03-26 07:13:48 [1] Train Extra: lr=0.0000349 inv=0.4357813 sub=0.0000000
17-03-26 07:15:11 [1] Step: 74900 Acc: 0.69469 0.84780 Cost: 0.85767 0.55169 0.20251 0.10346 Time: 0.00075
17-03-26 07:15:11 [1] Train Extra: lr=0.0000348 inv=0.4487500 sub=0.0000000
17-03-26 07:16:39 [1] Step: 75000 Acc: 0.69781 0.85035 Cost: 1.00767 0.61314 0.29108 0.10346 Time: 0.00075
17-03-26 07:16:39 [1] Train Extra: lr=0.0000347 inv=0.4473437 sub=0.0000000
17-03-26 07:17:37 [1] Step: 75000 Eval acc: 0.68441 0.85708 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 07:17:37 [1] Eval Extra: inv=0.3902385
17-03-26 07:17:37 [1] Checkpointing.
17-03-26 07:18:57 [1] Step: 75100 Acc: 0.69312 0.84524 Cost: 0.85026 0.59672 0.14997 0.10357 Time: 0.00071
17-03-26 07:18:57 [1] Train Extra: lr=0.0000346 inv=0.4598437 sub=0.0000000
17-03-26 07:20:19 [1] Step: 75200 Acc: 0.70281 0.84278 Cost: 0.99925 0.71408 0.18162 0.10355 Time: 0.00074
17-03-26 07:20:19 [1] Train Extra: lr=0.0000345 inv=0.4560938 sub=0.0000000
17-03-26 07:21:43 [1] Step: 75300 Acc: 0.68719 0.84595 Cost: 1.02414 0.68261 0.23798 0.10355 Time: 0.00073
17-03-26 07:21:43 [1] Train Extra: lr=0.0000344 inv=0.4406250 sub=0.0000000
17-03-26 07:22:58 [1] Step: 75400 Acc: 0.68531 0.85247 Cost: 1.00351 0.72137 0.17854 0.10360 Time: 0.00072
17-03-26 07:22:58 [1] Train Extra: lr=0.0000343 inv=0.4026562 sub=0.0000000
17-03-26 07:24:15 [1] Step: 75500 Acc: 0.68625 0.84898 Cost: 1.14310 0.74642 0.29307 0.10361 Time: 0.00074
17-03-26 07:24:15 [1] Train Extra: lr=0.0000342 inv=0.4237500 sub=0.0000000
17-03-26 07:25:14 [1] Step: 75500 Eval acc: 0.68386 0.85355 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 07:25:14 [1] Eval Extra: inv=0.4375000
17-03-26 07:26:40 [1] Step: 75600 Acc: 0.70531 0.84850 Cost: 1.07906 0.73865 0.23672 0.10369 Time: 0.00076
17-03-26 07:26:40 [1] Train Extra: lr=0.0000341 inv=0.4490625 sub=0.0000000
17-03-26 07:28:01 [1] Step: 75700 Acc: 0.69281 0.84885 Cost: 1.05946 0.70105 0.25460 0.10381 Time: 0.00072
17-03-26 07:28:01 [1] Train Extra: lr=0.0000340 inv=0.4390625 sub=0.0000000
17-03-26 07:29:26 [1] Step: 75800 Acc: 0.71188 0.85214 Cost: 0.94843 0.68666 0.15790 0.10388 Time: 0.00076
17-03-26 07:29:26 [1] Train Extra: lr=0.0000339 inv=0.4098438 sub=0.0000000
17-03-26 07:30:45 [1] Step: 75900 Acc: 0.71062 0.85184 Cost: 0.88214 0.49744 0.28061 0.10409 Time: 0.00074
17-03-26 07:30:45 [1] Train Extra: lr=0.0000338 inv=0.4032812 sub=0.0000000
17-03-26 07:32:11 [1] Step: 76000 Acc: 0.71531 0.85040 Cost: 0.92764 0.69072 0.13272 0.10420 Time: 0.00077
17-03-26 07:32:11 [1] Train Extra: lr=0.0000337 inv=0.4140625 sub=0.0000000
17-03-26 07:33:08 [1] Step: 76000 Eval acc: 0.68595 0.85172 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 07:33:08 [1] Eval Extra: inv=0.4135932
17-03-26 07:34:29 [1] Step: 76100 Acc: 0.71375 0.85301 Cost: 1.07867 0.68043 0.29392 0.10432 Time: 0.00074
17-03-26 07:34:29 [1] Train Extra: lr=0.0000336 inv=0.4275000 sub=0.0000000
17-03-26 07:35:48 [1] Step: 76200 Acc: 0.71219 0.85001 Cost: 0.71725 0.47010 0.14270 0.10444 Time: 0.00075
17-03-26 07:35:48 [1] Train Extra: lr=0.0000335 inv=0.4096875 sub=0.0000000
17-03-26 07:37:04 [1] Step: 76300 Acc: 0.71875 0.84987 Cost: 0.88312 0.67377 0.10479 0.10456 Time: 0.00073
17-03-26 07:37:04 [1] Train Extra: lr=0.0000334 inv=0.3942188 sub=0.0000000
17-03-26 07:38:29 [1] Step: 76400 Acc: 0.70188 0.84781 Cost: 0.88498 0.63177 0.14860 0.10461 Time: 0.00078
17-03-26 07:38:29 [1] Train Extra: lr=0.0000333 inv=0.4287500 sub=0.0000000
17-03-26 07:39:53 [1] Step: 76500 Acc: 0.71969 0.85052 Cost: 0.80456 0.51525 0.18457 0.10474 Time: 0.00075
17-03-26 07:39:53 [1] Train Extra: lr=0.0000332 inv=0.3978125 sub=0.0000000
17-03-26 07:40:50 [1] Step: 76500 Eval acc: 0.68375 0.85451 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 07:40:50 [1] Eval Extra: inv=0.4133723
17-03-26 07:42:21 [1] Step: 76600 Acc: 0.69063 0.85002 Cost: 0.78625 0.49204 0.18935 0.10487 Time: 0.00078
17-03-26 07:42:21 [1] Train Extra: lr=0.0000331 inv=0.4662500 sub=0.0000000
17-03-26 07:43:41 [1] Step: 76700 Acc: 0.70875 0.84991 Cost: 1.04260 0.76917 0.16842 0.10500 Time: 0.00073
17-03-26 07:43:41 [1] Train Extra: lr=0.0000330 inv=0.4321875 sub=0.0000000
17-03-26 07:45:07 [1] Step: 76800 Acc: 0.71469 0.84821 Cost: 0.97134 0.65865 0.20764 0.10505 Time: 0.00076
17-03-26 07:45:07 [1] Train Extra: lr=0.0000329 inv=0.4421875 sub=0.0000000
17-03-26 07:46:41 [1] Step: 76900 Acc: 0.68781 0.85276 Cost: 1.14812 0.82347 0.21946 0.10519 Time: 0.00082
17-03-26 07:46:41 [1] Train Extra: lr=0.0000328 inv=0.4346875 sub=0.0000000
17-03-26 07:48:04 [1] Step: 77000 Acc: 0.71250 0.84489 Cost: 1.09739 0.73112 0.26099 0.10528 Time: 0.00073
17-03-26 07:48:04 [1] Train Extra: lr=0.0000327 inv=0.4590625 sub=0.0000000
17-03-26 07:49:03 [1] Step: 77000 Eval acc: 0.67878 0.85340 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 07:49:03 [1] Eval Extra: inv=0.4080720
17-03-26 07:50:21 [1] Step: 77100 Acc: 0.70906 0.85016 Cost: 0.92373 0.70608 0.11227 0.10538 Time: 0.00075
17-03-26 07:50:21 [1] Train Extra: lr=0.0000326 inv=0.4137500 sub=0.0000000
17-03-26 07:51:37 [1] Step: 77200 Acc: 0.71281 0.84828 Cost: 0.66890 0.45447 0.10897 0.10546 Time: 0.00073
17-03-26 07:51:37 [1] Train Extra: lr=0.0000326 inv=0.4004687 sub=0.0000000
17-03-26 07:52:55 [1] Step: 77300 Acc: 0.70625 0.84252 Cost: 1.24621 0.84827 0.29241 0.10553 Time: 0.00072
17-03-26 07:52:55 [1] Train Extra: lr=0.0000325 inv=0.4259375 sub=0.0000000
17-03-26 07:54:19 [1] Step: 77400 Acc: 0.70562 0.84682 Cost: 0.74487 0.52659 0.11265 0.10563 Time: 0.00075
17-03-26 07:54:19 [1] Train Extra: lr=0.0000324 inv=0.4526562 sub=0.0000000
17-03-26 07:55:35 [1] Step: 77500 Acc: 0.72000 0.85341 Cost: 0.93089 0.63404 0.19112 0.10573 Time: 0.00073
17-03-26 07:55:35 [1] Train Extra: lr=0.0000323 inv=0.3903125 sub=0.0000000
17-03-26 07:56:32 [1] Step: 77500 Eval acc: 0.67867 0.84771 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 07:56:32 [1] Eval Extra: inv=0.4296047
17-03-26 07:58:03 [1] Step: 77600 Acc: 0.70125 0.84290 Cost: 1.04719 0.76670 0.17471 0.10579 Time: 0.00078
17-03-26 07:58:03 [1] Train Extra: lr=0.0000322 inv=0.4662500 sub=0.0000000
17-03-26 07:59:24 [1] Step: 77700 Acc: 0.69812 0.84633 Cost: 0.82699 0.60585 0.11532 0.10582 Time: 0.00074
17-03-26 07:59:24 [1] Train Extra: lr=0.0000321 inv=0.4351563 sub=0.0000000
17-03-26 08:00:45 [1] Step: 77800 Acc: 0.70188 0.84472 Cost: 1.06336 0.73248 0.22495 0.10593 Time: 0.00074
17-03-26 08:00:45 [1] Train Extra: lr=0.0000320 inv=0.4193750 sub=0.0000000
17-03-26 08:02:10 [1] Step: 77900 Acc: 0.71250 0.84951 Cost: 1.18152 0.82282 0.25266 0.10605 Time: 0.00078
17-03-26 08:02:10 [1] Train Extra: lr=0.0000319 inv=0.4315625 sub=0.0000000
17-03-26 08:03:31 [1] Step: 78000 Acc: 0.69750 0.85057 Cost: 1.19247 0.83345 0.25291 0.10612 Time: 0.00074
17-03-26 08:03:31 [1] Train Extra: lr=0.0000318 inv=0.4401563 sub=0.0000000
17-03-26 08:04:29 [1] Step: 78000 Eval acc: 0.67922 0.84847 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 08:04:29 [1] Eval Extra: inv=0.4351811
17-03-26 08:05:50 [1] Step: 78100 Acc: 0.71375 0.84686 Cost: 1.21986 0.85973 0.25390 0.10622 Time: 0.00075
17-03-26 08:05:50 [1] Train Extra: lr=0.0000317 inv=0.4260937 sub=0.0000000
17-03-26 08:07:15 [1] Step: 78200 Acc: 0.70469 0.84987 Cost: 1.12132 0.72693 0.28810 0.10629 Time: 0.00077
17-03-26 08:07:15 [1] Train Extra: lr=0.0000316 inv=0.4312500 sub=0.0000000
17-03-26 08:08:38 [1] Step: 78300 Acc: 0.70281 0.84476 Cost: 1.02861 0.64331 0.27898 0.10632 Time: 0.00074
17-03-26 08:08:38 [1] Train Extra: lr=0.0000315 inv=0.4295312 sub=0.0000000
17-03-26 08:09:59 [1] Step: 78400 Acc: 0.70156 0.85152 Cost: 0.85984 0.61291 0.14052 0.10640 Time: 0.00075
17-03-26 08:09:59 [1] Train Extra: lr=0.0000314 inv=0.4246875 sub=0.0000000
17-03-26 08:11:13 [1] Step: 78500 Acc: 0.72031 0.85249 Cost: 1.15504 0.80675 0.24185 0.10645 Time: 0.00073
17-03-26 08:11:13 [1] Train Extra: lr=0.0000314 inv=0.4167188 sub=0.0000000
17-03-26 08:12:11 [1] Step: 78500 Eval acc: 0.68386 0.85263 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 08:12:11 [1] Eval Extra: inv=0.4079064
17-03-26 08:13:25 [1] Step: 78600 Acc: 0.68312 0.84833 Cost: 0.89365 0.61571 0.17137 0.10657 Time: 0.00072
17-03-26 08:13:25 [1] Train Extra: lr=0.0000313 inv=0.3925000 sub=0.0000000
17-03-26 08:14:43 [1] Step: 78700 Acc: 0.68906 0.84869 Cost: 0.94156 0.73029 0.10465 0.10663 Time: 0.00075
17-03-26 08:14:43 [1] Train Extra: lr=0.0000312 inv=0.3964063 sub=0.0000000
17-03-26 08:16:04 [1] Step: 78800 Acc: 0.69188 0.84684 Cost: 1.00252 0.71426 0.18149 0.10677 Time: 0.00073
17-03-26 08:16:04 [1] Train Extra: lr=0.0000311 inv=0.4181250 sub=0.0000000
17-03-26 08:17:31 [1] Step: 78900 Acc: 0.69125 0.84389 Cost: 1.12837 0.76517 0.25632 0.10689 Time: 0.00077
17-03-26 08:17:31 [1] Train Extra: lr=0.0000310 inv=0.4287500 sub=0.0000000
17-03-26 08:18:56 [1] Step: 79000 Acc: 0.69688 0.84523 Cost: 1.01800 0.72679 0.18427 0.10694 Time: 0.00076
17-03-26 08:18:56 [1] Train Extra: lr=0.0000309 inv=0.4301563 sub=0.0000000
17-03-26 08:19:54 [1] Step: 79000 Eval acc: 0.68419 0.85264 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 08:19:54 [1] Eval Extra: inv=0.4399293
17-03-26 08:21:19 [1] Step: 79100 Acc: 0.69500 0.85865 Cost: 1.14328 0.85570 0.18065 0.10693 Time: 0.00080
17-03-26 08:21:19 [1] Train Extra: lr=0.0000308 inv=0.3909375 sub=0.0000000
17-03-26 08:22:43 [1] Step: 79200 Acc: 0.69500 0.83753 Cost: 1.16222 0.76730 0.28795 0.10697 Time: 0.00072
17-03-26 08:22:43 [1] Train Extra: lr=0.0000307 inv=0.4493750 sub=0.0000000
17-03-26 08:24:08 [1] Step: 79300 Acc: 0.69812 0.84680 Cost: 1.09809 0.78533 0.20573 0.10703 Time: 0.00077
17-03-26 08:24:08 [1] Train Extra: lr=0.0000306 inv=0.4254688 sub=0.0000000
17-03-26 08:25:30 [1] Step: 79400 Acc: 0.69219 0.84647 Cost: 0.91785 0.65313 0.15768 0.10704 Time: 0.00075
17-03-26 08:25:30 [1] Train Extra: lr=0.0000306 inv=0.4282813 sub=0.0000000
17-03-26 08:27:09 [1] Step: 79500 Acc: 0.70000 0.85637 Cost: 1.58322 1.13854 0.33749 0.10719 Time: 0.00082
17-03-26 08:27:09 [1] Train Extra: lr=0.0000305 inv=0.4353125 sub=0.0000000
17-03-26 08:28:05 [1] Step: 79500 Eval acc: 0.67845 0.85661 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 08:28:05 [1] Eval Extra: inv=0.4662102
17-03-26 08:29:25 [1] Step: 79600 Acc: 0.71219 0.85093 Cost: 1.15055 0.77819 0.26508 0.10727 Time: 0.00074
17-03-26 08:29:25 [1] Train Extra: lr=0.0000304 inv=0.4300000 sub=0.0000000
17-03-26 08:30:51 [1] Step: 79700 Acc: 0.68781 0.84637 Cost: 1.03832 0.75876 0.17225 0.10731 Time: 0.00076
17-03-26 08:30:51 [1] Train Extra: lr=0.0000303 inv=0.4567188 sub=0.0000000
17-03-26 08:32:19 [1] Step: 79800 Acc: 0.68312 0.84992 Cost: 1.02028 0.71342 0.19947 0.10738 Time: 0.00075
17-03-26 08:32:19 [1] Train Extra: lr=0.0000302 inv=0.4503125 sub=0.0000000
17-03-26 08:33:45 [1] Step: 79900 Acc: 0.70375 0.85137 Cost: 0.94263 0.61790 0.21729 0.10744 Time: 0.00078
17-03-26 08:33:45 [1] Train Extra: lr=0.0000301 inv=0.4357813 sub=0.0000000
17-03-26 08:35:07 [1] Step: 80000 Acc: 0.69094 0.85294 Cost: 0.89968 0.65742 0.13469 0.10757 Time: 0.00075
17-03-26 08:35:07 [1] Train Extra: lr=0.0000300 inv=0.4209375 sub=0.0000000
17-03-26 08:36:08 [1] Step: 80000 Eval acc: 0.68419 0.85779 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00018
17-03-26 08:36:08 [1] Eval Extra: inv=0.3768220
17-03-26 08:36:08 [1] Checkpointing.
17-03-26 08:37:31 [1] Step: 80100 Acc: 0.70906 0.84762 Cost: 1.18177 0.77960 0.29458 0.10759 Time: 0.00074
17-03-26 08:37:31 [1] Train Extra: lr=0.0000299 inv=0.4146875 sub=0.0000000
17-03-26 08:38:54 [1] Step: 80200 Acc: 0.70719 0.84433 Cost: 0.95950 0.67661 0.17535 0.10754 Time: 0.00074
17-03-26 08:38:54 [1] Train Extra: lr=0.0000299 inv=0.4303125 sub=0.0000000
17-03-26 08:40:16 [1] Step: 80300 Acc: 0.69500 0.84871 Cost: 1.13712 0.76466 0.26485 0.10761 Time: 0.00074
17-03-26 08:40:16 [1] Train Extra: lr=0.0000298 inv=0.4298438 sub=0.0000000
17-03-26 08:41:41 [1] Step: 80400 Acc: 0.70719 0.84981 Cost: 0.83878 0.56861 0.16253 0.10764 Time: 0.00077
17-03-26 08:41:41 [1] Train Extra: lr=0.0000297 inv=0.4081250 sub=0.0000000
17-03-26 08:43:01 [1] Step: 80500 Acc: 0.69406 0.85537 Cost: 0.86848 0.54172 0.21909 0.10767 Time: 0.00076
17-03-26 08:43:01 [1] Train Extra: lr=0.0000296 inv=0.4051562 sub=0.0000000
17-03-26 08:43:58 [1] Step: 80500 Eval acc: 0.68352 0.85290 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 08:43:58 [1] Eval Extra: inv=0.4273962
17-03-26 08:45:23 [1] Step: 80600 Acc: 0.69781 0.84912 Cost: 0.97807 0.66427 0.20609 0.10770 Time: 0.00075
17-03-26 08:45:23 [1] Train Extra: lr=0.0000295 inv=0.4404688 sub=0.0000000
17-03-26 08:46:44 [1] Step: 80700 Acc: 0.69219 0.85822 Cost: 0.80889 0.51119 0.18994 0.10776 Time: 0.00077
17-03-26 08:46:44 [1] Train Extra: lr=0.0000294 inv=0.4189062 sub=0.0000000
17-03-26 08:48:05 [1] Step: 80800 Acc: 0.70031 0.85091 Cost: 0.90031 0.63551 0.15700 0.10781 Time: 0.00073
17-03-26 08:48:05 [1] Train Extra: lr=0.0000294 inv=0.4075000 sub=0.0000000
17-03-26 08:49:31 [1] Step: 80900 Acc: 0.67688 0.84614 Cost: 0.92408 0.59420 0.22196 0.10792 Time: 0.00074
17-03-26 08:49:31 [1] Train Extra: lr=0.0000293 inv=0.4448437 sub=0.0000000
17-03-26 08:50:56 [1] Step: 81000 Acc: 0.69688 0.85402 Cost: 0.99381 0.62347 0.26236 0.10799 Time: 0.00079
17-03-26 08:50:56 [1] Train Extra: lr=0.0000292 inv=0.3920312 sub=0.0000000
17-03-26 08:51:54 [1] Step: 81000 Eval acc: 0.68640 0.85457 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 08:51:54 [1] Eval Extra: inv=0.4232553
17-03-26 08:53:19 [1] Step: 81100 Acc: 0.69500 0.84824 Cost: 1.26756 0.89053 0.26895 0.10808 Time: 0.00078
17-03-26 08:53:19 [1] Train Extra: lr=0.0000291 inv=0.4260937 sub=0.0000000
17-03-26 08:54:28 [1] Step: 81200 Acc: 0.69656 0.85047 Cost: 0.83496 0.55407 0.17279 0.10809 Time: 0.00070
17-03-26 08:54:28 [1] Train Extra: lr=0.0000290 inv=0.4103125 sub=0.0000000
17-03-26 08:56:01 [1] Step: 81300 Acc: 0.70688 0.85886 Cost: 1.09301 0.74748 0.23743 0.10810 Time: 0.00082
17-03-26 08:56:01 [1] Train Extra: lr=0.0000289 inv=0.4271875 sub=0.0000000
17-03-26 08:57:34 [1] Step: 81400 Acc: 0.67188 0.85458 Cost: 1.19882 0.77827 0.31235 0.10820 Time: 0.00080
17-03-26 08:57:34 [1] Train Extra: lr=0.0000288 inv=0.4279688 sub=0.0000000
17-03-26 08:58:57 [1] Step: 81500 Acc: 0.68469 0.85027 Cost: 1.21737 0.80326 0.30587 0.10823 Time: 0.00076
17-03-26 08:58:57 [1] Train Extra: lr=0.0000288 inv=0.4306250 sub=0.0000000
17-03-26 08:59:54 [1] Step: 81500 Eval acc: 0.68617 0.85530 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 08:59:54 [1] Eval Extra: inv=0.4455610
17-03-26 09:01:14 [1] Step: 81600 Acc: 0.71313 0.84916 Cost: 0.86964 0.59316 0.16822 0.10825 Time: 0.00072
17-03-26 09:01:14 [1] Train Extra: lr=0.0000287 inv=0.4164062 sub=0.0000000
17-03-26 09:02:31 [1] Step: 81700 Acc: 0.70063 0.84955 Cost: 0.98840 0.62763 0.25249 0.10829 Time: 0.00073
17-03-26 09:02:31 [1] Train Extra: lr=0.0000286 inv=0.4339062 sub=0.0000000
17-03-26 09:03:56 [1] Step: 81800 Acc: 0.69656 0.84384 Cost: 1.03177 0.65980 0.26363 0.10834 Time: 0.00075
17-03-26 09:03:56 [1] Train Extra: lr=0.0000285 inv=0.4293750 sub=0.0000000
17-03-26 09:05:15 [1] Step: 81900 Acc: 0.70813 0.85402 Cost: 0.75533 0.54907 0.09795 0.10831 Time: 0.00076
17-03-26 09:05:15 [1] Train Extra: lr=0.0000284 inv=0.3987500 sub=0.0000000
17-03-26 09:06:48 [1] Step: 82000 Acc: 0.68719 0.85284 Cost: 0.83740 0.56955 0.15954 0.10832 Time: 0.00079
17-03-26 09:06:48 [1] Train Extra: lr=0.0000284 inv=0.4314062 sub=0.0000000
17-03-26 09:07:46 [1] Step: 82000 Eval acc: 0.68684 0.85145 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 09:07:46 [1] Eval Extra: inv=0.4394876
17-03-26 09:09:12 [1] Step: 82100 Acc: 0.69469 0.85311 Cost: 1.15141 0.89871 0.14432 0.10838 Time: 0.00076
17-03-26 09:09:12 [1] Train Extra: lr=0.0000283 inv=0.4268750 sub=0.0000000
17-03-26 09:10:26 [1] Step: 82200 Acc: 0.70562 0.84962 Cost: 0.94508 0.76031 0.07637 0.10840 Time: 0.00071
17-03-26 09:10:26 [1] Train Extra: lr=0.0000282 inv=0.4087500 sub=0.0000000
17-03-26 09:11:49 [1] Step: 82300 Acc: 0.70562 0.84478 Cost: 1.11603 0.71307 0.29455 0.10841 Time: 0.00075
17-03-26 09:11:49 [1] Train Extra: lr=0.0000281 inv=0.4300000 sub=0.0000000
17-03-26 09:13:03 [1] Step: 82400 Acc: 0.70406 0.84690 Cost: 1.27428 0.91873 0.24705 0.10850 Time: 0.00071
17-03-26 09:13:03 [1] Train Extra: lr=0.0000280 inv=0.3928125 sub=0.0000000
17-03-26 09:14:25 [1] Step: 82500 Acc: 0.70219 0.84778 Cost: 0.88592 0.63877 0.13855 0.10861 Time: 0.00075
17-03-26 09:14:25 [1] Train Extra: lr=0.0000279 inv=0.4326563 sub=0.0000000
17-03-26 09:15:23 [1] Step: 82500 Eval acc: 0.68463 0.85327 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 09:15:23 [1] Eval Extra: inv=0.3927231
17-03-26 09:16:45 [1] Step: 82600 Acc: 0.68719 0.85108 Cost: 0.97927 0.75975 0.11090 0.10861 Time: 0.00075
17-03-26 09:16:45 [1] Train Extra: lr=0.0000279 inv=0.4214062 sub=0.0000000
17-03-26 09:18:15 [1] Step: 82700 Acc: 0.69156 0.85093 Cost: 1.23625 0.92369 0.20398 0.10857 Time: 0.00079
17-03-26 09:18:15 [1] Train Extra: lr=0.0000278 inv=0.4296875 sub=0.0000000
17-03-26 09:19:34 [1] Step: 82800 Acc: 0.69031 0.84383 Cost: 1.04503 0.74337 0.19304 0.10862 Time: 0.00073
17-03-26 09:19:34 [1] Train Extra: lr=0.0000277 inv=0.4240625 sub=0.0000000
17-03-26 09:20:50 [1] Step: 82900 Acc: 0.69469 0.84422 Cost: 0.87122 0.54986 0.21271 0.10865 Time: 0.00069
17-03-26 09:20:50 [1] Train Extra: lr=0.0000276 inv=0.4428125 sub=0.0000000
17-03-26 09:22:13 [1] Step: 83000 Acc: 0.69875 0.85159 Cost: 0.75505 0.56636 0.08004 0.10864 Time: 0.00079
17-03-26 09:22:13 [1] Train Extra: lr=0.0000276 inv=0.4026562 sub=0.0000000
17-03-26 09:23:11 [1] Step: 83000 Eval acc: 0.68606 0.85419 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 09:23:11 [1] Eval Extra: inv=0.4192800
17-03-26 09:24:25 [1] Step: 83100 Acc: 0.68688 0.84559 Cost: 1.11934 0.78530 0.22537 0.10867 Time: 0.00071
17-03-26 09:24:25 [1] Train Extra: lr=0.0000275 inv=0.4076562 sub=0.0000000
17-03-26 09:25:45 [1] Step: 83200 Acc: 0.71062 0.84138 Cost: 1.21262 0.91973 0.18428 0.10861 Time: 0.00071
17-03-26 09:25:45 [1] Train Extra: lr=0.0000274 inv=0.4468750 sub=0.0000000
17-03-26 09:27:18 [1] Step: 83300 Acc: 0.69875 0.85483 Cost: 0.74705 0.52094 0.11749 0.10862 Time: 0.00081
17-03-26 09:27:18 [1] Train Extra: lr=0.0000273 inv=0.4118750 sub=0.0000000
17-03-26 09:28:56 [1] Step: 83400 Acc: 0.69594 0.85186 Cost: 1.18912 0.77363 0.30686 0.10863 Time: 0.00080
17-03-26 09:28:56 [1] Train Extra: lr=0.0000272 inv=0.4300000 sub=0.0000000
17-03-26 09:30:21 [1] Step: 83500 Acc: 0.69000 0.84910 Cost: 1.14854 0.82052 0.21932 0.10870 Time: 0.00075
17-03-26 09:30:21 [1] Train Extra: lr=0.0000272 inv=0.4310937 sub=0.0000000
17-03-26 09:31:18 [1] Step: 83500 Eval acc: 0.68154 0.84777 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 09:31:18 [1] Eval Extra: inv=0.4320892
17-03-26 09:32:39 [1] Step: 83600 Acc: 0.70813 0.84694 Cost: 1.05502 0.67197 0.27428 0.10878 Time: 0.00074
17-03-26 09:32:39 [1] Train Extra: lr=0.0000271 inv=0.4375000 sub=0.0000000
17-03-26 09:34:00 [1] Step: 83700 Acc: 0.71594 0.85159 Cost: 0.84557 0.59592 0.14080 0.10884 Time: 0.00076
17-03-26 09:34:00 [1] Train Extra: lr=0.0000270 inv=0.3873437 sub=0.0000000
17-03-26 09:35:26 [1] Step: 83800 Acc: 0.68469 0.85953 Cost: 1.01866 0.71020 0.19953 0.10893 Time: 0.00077
17-03-26 09:35:26 [1] Train Extra: lr=0.0000269 inv=0.4007812 sub=0.0000000
17-03-26 09:36:47 [1] Step: 83900 Acc: 0.69656 0.85442 Cost: 0.96274 0.64822 0.20551 0.10902 Time: 0.00076
17-03-26 09:36:47 [1] Train Extra: lr=0.0000268 inv=0.3964063 sub=0.0000000
17-03-26 09:38:14 [1] Step: 84000 Acc: 0.72250 0.84483 Cost: 1.01148 0.62963 0.27274 0.10911 Time: 0.00076
17-03-26 09:38:14 [1] Train Extra: lr=0.0000268 inv=0.4257812 sub=0.0000000
17-03-26 09:39:13 [1] Step: 84000 Eval acc: 0.68242 0.85486 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 09:39:13 [1] Eval Extra: inv=0.3945451
17-03-26 09:40:38 [1] Step: 84100 Acc: 0.72188 0.84735 Cost: 0.99675 0.65744 0.23001 0.10930 Time: 0.00075
17-03-26 09:40:38 [1] Train Extra: lr=0.0000267 inv=0.4201563 sub=0.0000000
17-03-26 09:42:02 [1] Step: 84200 Acc: 0.71281 0.84976 Cost: 0.92896 0.65956 0.15995 0.10945 Time: 0.00076
17-03-26 09:42:02 [1] Train Extra: lr=0.0000266 inv=0.4198438 sub=0.0000000
17-03-26 09:43:30 [1] Step: 84300 Acc: 0.72656 0.85151 Cost: 0.95881 0.64590 0.20342 0.10950 Time: 0.00076
17-03-26 09:43:30 [1] Train Extra: lr=0.0000265 inv=0.4437500 sub=0.0000000
17-03-26 09:44:49 [1] Step: 84400 Acc: 0.72750 0.84924 Cost: 0.65595 0.48007 0.06629 0.10959 Time: 0.00074
17-03-26 09:44:49 [1] Train Extra: lr=0.0000265 inv=0.4050000 sub=0.0000000
17-03-26 09:46:07 [1] Step: 84500 Acc: 0.71719 0.84439 Cost: 1.05719 0.67371 0.27374 0.10974 Time: 0.00072
17-03-26 09:46:07 [1] Train Extra: lr=0.0000264 inv=0.4357813 sub=0.0000000
17-03-26 09:47:03 [1] Step: 84500 Eval acc: 0.68198 0.85510 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 09:47:03 [1] Eval Extra: inv=0.4252981
17-03-26 09:48:29 [1] Step: 84600 Acc: 0.72250 0.85145 Cost: 0.97932 0.65326 0.21623 0.10983 Time: 0.00076
17-03-26 09:48:29 [1] Train Extra: lr=0.0000263 inv=0.4239062 sub=0.0000000
17-03-26 09:49:43 [1] Step: 84700 Acc: 0.71937 0.85074 Cost: 0.76434 0.45183 0.20249 0.11002 Time: 0.00072
17-03-26 09:49:43 [1] Train Extra: lr=0.0000262 inv=0.4153125 sub=0.0000000
17-03-26 09:51:08 [1] Step: 84800 Acc: 0.73062 0.84900 Cost: 0.96308 0.62425 0.22871 0.11012 Time: 0.00078
17-03-26 09:51:08 [1] Train Extra: lr=0.0000262 inv=0.4143750 sub=0.0000000
17-03-26 09:52:31 [1] Step: 84900 Acc: 0.71156 0.84967 Cost: 1.11947 0.73788 0.27141 0.11018 Time: 0.00073
17-03-26 09:52:31 [1] Train Extra: lr=0.0000261 inv=0.4273438 sub=0.0000000
17-03-26 09:53:48 [1] Step: 85000 Acc: 0.71437 0.84899 Cost: 1.09661 0.73773 0.24863 0.11025 Time: 0.00073
17-03-26 09:53:48 [1] Train Extra: lr=0.0000260 inv=0.4510938 sub=0.0000000
17-03-26 09:54:46 [1] Step: 85000 Eval acc: 0.68297 0.85295 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 09:54:46 [1] Eval Extra: inv=0.4112191
17-03-26 09:54:46 [1] Checkpointing.
17-03-26 09:56:05 [1] Step: 85100 Acc: 0.71156 0.84629 Cost: 0.87608 0.65275 0.11299 0.11033 Time: 0.00073
17-03-26 09:56:05 [1] Train Extra: lr=0.0000259 inv=0.4184375 sub=0.0000000
17-03-26 09:57:26 [1] Step: 85200 Acc: 0.71313 0.85635 Cost: 1.22315 0.84657 0.26604 0.11054 Time: 0.00076
17-03-26 09:57:26 [1] Train Extra: lr=0.0000259 inv=0.3837500 sub=0.0000000
17-03-26 09:58:50 [1] Step: 85300 Acc: 0.70969 0.84875 Cost: 1.15367 0.74424 0.29882 0.11061 Time: 0.00076
17-03-26 09:58:50 [1] Train Extra: lr=0.0000258 inv=0.4212500 sub=0.0000000
17-03-26 10:00:09 [1] Step: 85400 Acc: 0.70844 0.84950 Cost: 1.07045 0.81620 0.14354 0.11071 Time: 0.00074
17-03-26 10:00:09 [1] Train Extra: lr=0.0000257 inv=0.4215625 sub=0.0000000
17-03-26 10:01:34 [1] Step: 85500 Acc: 0.72062 0.85790 Cost: 0.74735 0.54153 0.09505 0.11077 Time: 0.00077
17-03-26 10:01:34 [1] Train Extra: lr=0.0000256 inv=0.4292187 sub=0.0000000
17-03-26 10:02:32 [1] Step: 85500 Eval acc: 0.68739 0.84982 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 10:02:32 [1] Eval Extra: inv=0.4147527
17-03-26 10:03:47 [1] Step: 85600 Acc: 0.71750 0.85213 Cost: 0.72843 0.42565 0.19187 0.11091 Time: 0.00071
17-03-26 10:03:47 [1] Train Extra: lr=0.0000256 inv=0.4121875 sub=0.0000000
17-03-26 10:05:11 [1] Step: 85700 Acc: 0.71094 0.84474 Cost: 1.01776 0.70821 0.19855 0.11100 Time: 0.00075
17-03-26 10:05:11 [1] Train Extra: lr=0.0000255 inv=0.4507813 sub=0.0000000
17-03-26 10:06:30 [1] Step: 85800 Acc: 0.71250 0.85531 Cost: 1.08113 0.83918 0.13089 0.11106 Time: 0.00077
17-03-26 10:06:30 [1] Train Extra: lr=0.0000254 inv=0.3979687 sub=0.0000000
17-03-26 10:07:53 [1] Step: 85900 Acc: 0.72875 0.84834 Cost: 1.34012 0.83977 0.38918 0.11118 Time: 0.00074
17-03-26 10:07:53 [1] Train Extra: lr=0.0000253 inv=0.4026562 sub=0.0000000
17-03-26 10:09:17 [1] Step: 86000 Acc: 0.70312 0.84811 Cost: 1.01289 0.67718 0.22443 0.11128 Time: 0.00076
17-03-26 10:09:17 [1] Train Extra: lr=0.0000253 inv=0.4343750 sub=0.0000000
17-03-26 10:10:15 [1] Step: 86000 Eval acc: 0.67745 0.84924 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 10:10:15 [1] Eval Extra: inv=0.4193905
17-03-26 10:11:29 [1] Step: 86100 Acc: 0.72375 0.85302 Cost: 0.93516 0.65525 0.16855 0.11136 Time: 0.00074
17-03-26 10:11:29 [1] Train Extra: lr=0.0000252 inv=0.3935938 sub=0.0000000
17-03-26 10:12:55 [1] Step: 86200 Acc: 0.71937 0.84939 Cost: 0.99926 0.72987 0.15794 0.11145 Time: 0.00077
17-03-26 10:12:55 [1] Train Extra: lr=0.0000251 inv=0.4009375 sub=0.0000000
17-03-26 10:14:20 [1] Step: 86300 Acc: 0.71594 0.85000 Cost: 1.06625 0.71625 0.23845 0.11154 Time: 0.00076
17-03-26 10:14:20 [1] Train Extra: lr=0.0000251 inv=0.4181250 sub=0.0000000
17-03-26 10:15:39 [1] Step: 86400 Acc: 0.71844 0.84865 Cost: 0.96780 0.56571 0.29045 0.11164 Time: 0.00074
17-03-26 10:15:39 [1] Train Extra: lr=0.0000250 inv=0.3987500 sub=0.0000000
17-03-26 10:16:58 [1] Step: 86500 Acc: 0.69437 0.84191 Cost: 0.93708 0.57560 0.24976 0.11172 Time: 0.00073
17-03-26 10:16:58 [1] Train Extra: lr=0.0000249 inv=0.4101562 sub=0.0000000
17-03-26 10:17:56 [1] Step: 86500 Eval acc: 0.68110 0.85205 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 10:17:56 [1] Eval Extra: inv=0.4129859
17-03-26 10:19:23 [1] Step: 86600 Acc: 0.69969 0.84795 Cost: 1.16826 0.89478 0.16175 0.11173 Time: 0.00076
17-03-26 10:19:23 [1] Train Extra: lr=0.0000248 inv=0.4409375 sub=0.0000000
17-03-26 10:20:54 [1] Step: 86700 Acc: 0.71188 0.84870 Cost: 1.05852 0.72647 0.22022 0.11183 Time: 0.00077
17-03-26 10:20:54 [1] Train Extra: lr=0.0000248 inv=0.4348437 sub=0.0000000
17-03-26 10:22:14 [1] Step: 86800 Acc: 0.69500 0.84850 Cost: 0.78936 0.52534 0.15212 0.11191 Time: 0.00072
17-03-26 10:22:14 [1] Train Extra: lr=0.0000247 inv=0.4293750 sub=0.0000000
17-03-26 10:23:38 [1] Step: 86900 Acc: 0.71750 0.85792 Cost: 1.01819 0.67083 0.23536 0.11200 Time: 0.00078
17-03-26 10:23:38 [1] Train Extra: lr=0.0000246 inv=0.4203125 sub=0.0000000
17-03-26 10:25:10 [1] Step: 87000 Acc: 0.70375 0.85507 Cost: 0.79008 0.51571 0.16229 0.11208 Time: 0.00079
17-03-26 10:25:10 [1] Train Extra: lr=0.0000246 inv=0.4056250 sub=0.0000000
17-03-26 10:26:09 [1] Step: 87000 Eval acc: 0.68275 0.85064 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 10:26:09 [1] Eval Extra: inv=0.4057531
17-03-26 10:27:36 [1] Step: 87100 Acc: 0.71313 0.84855 Cost: 0.85306 0.54663 0.19432 0.11212 Time: 0.00076
17-03-26 10:27:36 [1] Train Extra: lr=0.0000245 inv=0.4312500 sub=0.0000000
17-03-26 10:28:59 [1] Step: 87200 Acc: 0.70531 0.85232 Cost: 1.01663 0.65523 0.24921 0.11219 Time: 0.00077
17-03-26 10:28:59 [1] Train Extra: lr=0.0000244 inv=0.3921875 sub=0.0000000
17-03-26 10:30:17 [1] Step: 87300 Acc: 0.70469 0.84534 Cost: 1.11595 0.77171 0.23209 0.11216 Time: 0.00073
17-03-26 10:30:17 [1] Train Extra: lr=0.0000243 inv=0.4106250 sub=0.0000000
17-03-26 10:31:25 [1] Step: 87400 Acc: 0.72375 0.85445 Cost: 0.79195 0.52119 0.15855 0.11221 Time: 0.00068
17-03-26 10:31:25 [1] Train Extra: lr=0.0000243 inv=0.4039063 sub=0.0000000
17-03-26 10:32:43 [1] Step: 87500 Acc: 0.70031 0.84648 Cost: 1.04135 0.76795 0.16114 0.11226 Time: 0.00072
17-03-26 10:32:43 [1] Train Extra: lr=0.0000242 inv=0.4140625 sub=0.0000000
17-03-26 10:33:42 [1] Step: 87500 Eval acc: 0.68209 0.85162 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 10:33:42 [1] Eval Extra: inv=0.4538428
17-03-26 10:35:12 [1] Step: 87600 Acc: 0.70719 0.85138 Cost: 0.90494 0.67831 0.11433 0.11230 Time: 0.00077
17-03-26 10:35:12 [1] Train Extra: lr=0.0000241 inv=0.4543750 sub=0.0000000
17-03-26 10:36:36 [1] Step: 87700 Acc: 0.70375 0.85546 Cost: 0.97089 0.68348 0.17505 0.11236 Time: 0.00077
17-03-26 10:36:36 [1] Train Extra: lr=0.0000241 inv=0.4204688 sub=0.0000000
17-03-26 10:37:50 [1] Step: 87800 Acc: 0.71562 0.84849 Cost: 0.86987 0.64353 0.11390 0.11245 Time: 0.00072
17-03-26 10:37:50 [1] Train Extra: lr=0.0000240 inv=0.4023438 sub=0.0000000
17-03-26 10:39:10 [1] Step: 87900 Acc: 0.70375 0.84529 Cost: 0.93483 0.56150 0.26069 0.11263 Time: 0.00072
17-03-26 10:39:10 [1] Train Extra: lr=0.0000239 inv=0.4353125 sub=0.0000000
17-03-26 10:40:29 [1] Step: 88000 Acc: 0.70219 0.84371 Cost: 0.80507 0.46720 0.22514 0.11273 Time: 0.00073
17-03-26 10:40:29 [1] Train Extra: lr=0.0000239 inv=0.4103125 sub=0.0000000
17-03-26 10:41:26 [1] Step: 88000 Eval acc: 0.68110 0.85150 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 10:41:26 [1] Eval Extra: inv=0.4178445
17-03-26 10:42:44 [1] Step: 88100 Acc: 0.70281 0.86343 Cost: 0.81165 0.54445 0.15438 0.11281 Time: 0.00075
17-03-26 10:42:44 [1] Train Extra: lr=0.0000238 inv=0.4006250 sub=0.0000000
17-03-26 10:44:03 [1] Step: 88200 Acc: 0.70719 0.85278 Cost: 0.97609 0.65393 0.20935 0.11281 Time: 0.00075
17-03-26 10:44:03 [1] Train Extra: lr=0.0000237 inv=0.4160937 sub=0.0000000
17-03-26 10:45:27 [1] Step: 88300 Acc: 0.70594 0.84150 Cost: 0.89622 0.60384 0.17953 0.11285 Time: 0.00074
17-03-26 10:45:27 [1] Train Extra: lr=0.0000237 inv=0.4448437 sub=0.0000000
17-03-26 10:46:52 [1] Step: 88400 Acc: 0.69875 0.84731 Cost: 0.96144 0.63496 0.21367 0.11281 Time: 0.00076
17-03-26 10:46:52 [1] Train Extra: lr=0.0000236 inv=0.4376563 sub=0.0000000
17-03-26 10:48:21 [1] Step: 88500 Acc: 0.69500 0.85429 Cost: 1.17986 0.83921 0.22784 0.11281 Time: 0.00077
17-03-26 10:48:21 [1] Train Extra: lr=0.0000235 inv=0.4206250 sub=0.0000000
17-03-26 10:49:18 [1] Step: 88500 Eval acc: 0.68253 0.85341 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 10:49:18 [1] Eval Extra: inv=0.4179549
17-03-26 10:50:37 [1] Step: 88600 Acc: 0.71344 0.85260 Cost: 1.26296 0.79630 0.35374 0.11292 Time: 0.00075
17-03-26 10:50:37 [1] Train Extra: lr=0.0000235 inv=0.4048438 sub=0.0000000
17-03-26 10:52:04 [1] Step: 88700 Acc: 0.70813 0.84860 Cost: 0.94278 0.62733 0.20247 0.11298 Time: 0.00076
17-03-26 10:52:04 [1] Train Extra: lr=0.0000234 inv=0.4173438 sub=0.0000000
17-03-26 10:53:28 [1] Step: 88800 Acc: 0.68750 0.84872 Cost: 0.95571 0.66947 0.17323 0.11301 Time: 0.00075
17-03-26 10:53:28 [1] Train Extra: lr=0.0000233 inv=0.4129687 sub=0.0000000
17-03-26 10:54:52 [1] Step: 88900 Acc: 0.69812 0.84895 Cost: 1.13129 0.76536 0.25286 0.11307 Time: 0.00076
17-03-26 10:54:52 [1] Train Extra: lr=0.0000232 inv=0.4300000 sub=0.0000000
17-03-26 10:56:07 [1] Step: 89000 Acc: 0.71562 0.85273 Cost: 1.03618 0.64408 0.27903 0.11307 Time: 0.00073
17-03-26 10:56:07 [1] Train Extra: lr=0.0000232 inv=0.4295312 sub=0.0000000
17-03-26 10:57:05 [1] Step: 89000 Eval acc: 0.68187 0.85283 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 10:57:05 [1] Eval Extra: inv=0.4598609
17-03-26 10:58:23 [1] Step: 89100 Acc: 0.69094 0.84584 Cost: 0.95233 0.63337 0.20575 0.11321 Time: 0.00074
17-03-26 10:58:23 [1] Train Extra: lr=0.0000231 inv=0.4309375 sub=0.0000000
17-03-26 10:59:43 [1] Step: 89200 Acc: 0.69188 0.84696 Cost: 0.86288 0.52070 0.22891 0.11327 Time: 0.00073
17-03-26 10:59:43 [1] Train Extra: lr=0.0000230 inv=0.4373437 sub=0.0000000
17-03-26 11:00:56 [1] Step: 89300 Acc: 0.72313 0.85657 Cost: 0.85144 0.61385 0.12420 0.11339 Time: 0.00074
17-03-26 11:00:56 [1] Train Extra: lr=0.0000230 inv=0.3909375 sub=0.0000000
17-03-26 11:02:20 [1] Step: 89400 Acc: 0.68969 0.85109 Cost: 0.98784 0.57172 0.30271 0.11341 Time: 0.00078
17-03-26 11:02:20 [1] Train Extra: lr=0.0000229 inv=0.4206250 sub=0.0000000
17-03-26 11:03:47 [1] Step: 89500 Acc: 0.70813 0.83963 Cost: 0.84661 0.53048 0.20262 0.11351 Time: 0.00074
17-03-26 11:03:47 [1] Train Extra: lr=0.0000229 inv=0.4610938 sub=0.0000000
17-03-26 11:04:43 [1] Step: 89500 Eval acc: 0.68286 0.85521 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 11:04:43 [1] Eval Extra: inv=0.4361197
17-03-26 11:06:03 [1] Step: 89600 Acc: 0.70469 0.84896 Cost: 1.27901 0.92933 0.23612 0.11356 Time: 0.00073
17-03-26 11:06:03 [1] Train Extra: lr=0.0000228 inv=0.4289062 sub=0.0000000
17-03-26 11:07:34 [1] Step: 89700 Acc: 0.69375 0.84161 Cost: 1.32575 0.96257 0.24963 0.11355 Time: 0.00076
17-03-26 11:07:34 [1] Train Extra: lr=0.0000227 inv=0.4754687 sub=0.0000000
17-03-26 11:08:53 [1] Step: 89800 Acc: 0.69469 0.84609 Cost: 1.04761 0.68821 0.24586 0.11354 Time: 0.00072
17-03-26 11:08:53 [1] Train Extra: lr=0.0000227 inv=0.4392187 sub=0.0000000
17-03-26 11:10:22 [1] Step: 89900 Acc: 0.71656 0.85186 Cost: 0.85801 0.48174 0.26276 0.11351 Time: 0.00076
17-03-26 11:10:22 [1] Train Extra: lr=0.0000226 inv=0.4306250 sub=0.0000000
17-03-26 11:11:40 [1] Step: 90000 Acc: 0.71031 0.83923 Cost: 0.99669 0.71075 0.17245 0.11349 Time: 0.00072
17-03-26 11:11:40 [1] Train Extra: lr=0.0000225 inv=0.4454688 sub=0.0000000
17-03-26 11:12:38 [1] Step: 90000 Eval acc: 0.68441 0.84982 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 11:12:38 [1] Eval Extra: inv=0.4675905
17-03-26 11:12:38 [1] Checkpointing.
17-03-26 11:14:02 [1] Step: 90100 Acc: 0.70219 0.84799 Cost: 0.99906 0.71455 0.17097 0.11354 Time: 0.00077
17-03-26 11:14:02 [1] Train Extra: lr=0.0000225 inv=0.4460938 sub=0.0000000
17-03-26 11:15:29 [1] Step: 90200 Acc: 0.69063 0.84759 Cost: 1.04752 0.68657 0.24740 0.11355 Time: 0.00075
17-03-26 11:15:29 [1] Train Extra: lr=0.0000224 inv=0.4570312 sub=0.0000000
17-03-26 11:16:55 [1] Step: 90300 Acc: 0.69875 0.84785 Cost: 1.17156 0.80981 0.24817 0.11358 Time: 0.00075
17-03-26 11:16:55 [1] Train Extra: lr=0.0000223 inv=0.4581250 sub=0.0000000
17-03-26 11:18:23 [1] Step: 90400 Acc: 0.71156 0.84784 Cost: 0.96261 0.59615 0.25275 0.11371 Time: 0.00075
17-03-26 11:18:23 [1] Train Extra: lr=0.0000223 inv=0.4390625 sub=0.0000000
17-03-26 11:19:46 [1] Step: 90500 Acc: 0.70125 0.84906 Cost: 1.08227 0.74768 0.22076 0.11383 Time: 0.00077
17-03-26 11:19:46 [1] Train Extra: lr=0.0000222 inv=0.4243750 sub=0.0000000
17-03-26 11:20:44 [1] Step: 90500 Eval acc: 0.68629 0.85013 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 11:20:44 [1] Eval Extra: inv=0.4301016
17-03-26 11:22:07 [1] Step: 90600 Acc: 0.69844 0.84930 Cost: 1.06466 0.76637 0.18451 0.11378 Time: 0.00074
17-03-26 11:22:07 [1] Train Extra: lr=0.0000221 inv=0.4273438 sub=0.0000000
17-03-26 11:23:26 [1] Step: 90700 Acc: 0.69719 0.85203 Cost: 1.07622 0.77208 0.19035 0.11379 Time: 0.00076
17-03-26 11:23:26 [1] Train Extra: lr=0.0000221 inv=0.4120313 sub=0.0000000
17-03-26 11:24:49 [1] Step: 90800 Acc: 0.69719 0.85461 Cost: 1.00388 0.62657 0.26352 0.11379 Time: 0.00077
17-03-26 11:24:49 [1] Train Extra: lr=0.0000220 inv=0.4250000 sub=0.0000000
17-03-26 11:26:12 [1] Step: 90900 Acc: 0.69125 0.85967 Cost: 0.90685 0.60429 0.18873 0.11383 Time: 0.00076
17-03-26 11:26:12 [1] Train Extra: lr=0.0000219 inv=0.4057812 sub=0.0000000
17-03-26 11:27:40 [1] Step: 91000 Acc: 0.68688 0.84780 Cost: 0.98904 0.61291 0.26228 0.11385 Time: 0.00076
17-03-26 11:27:40 [1] Train Extra: lr=0.0000219 inv=0.4470312 sub=0.0000000
17-03-26 11:28:37 [1] Step: 91000 Eval acc: 0.68198 0.85722 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 11:28:37 [1] Eval Extra: inv=0.3917845
17-03-26 11:30:04 [1] Step: 91100 Acc: 0.70188 0.85626 Cost: 1.03079 0.82786 0.08910 0.11383 Time: 0.00077
17-03-26 11:30:04 [1] Train Extra: lr=0.0000218 inv=0.4334375 sub=0.0000000
17-03-26 11:31:36 [1] Step: 91200 Acc: 0.70250 0.85894 Cost: 1.37868 0.94458 0.32026 0.11384 Time: 0.00079
17-03-26 11:31:36 [1] Train Extra: lr=0.0000218 inv=0.4409375 sub=0.0000000
17-03-26 11:33:03 [1] Step: 91300 Acc: 0.69531 0.84914 Cost: 1.00101 0.60316 0.28393 0.11391 Time: 0.00075
17-03-26 11:33:03 [1] Train Extra: lr=0.0000217 inv=0.4392187 sub=0.0000000
17-03-26 11:34:23 [1] Step: 91400 Acc: 0.69219 0.85332 Cost: 1.08767 0.70155 0.27222 0.11390 Time: 0.00075
17-03-26 11:34:23 [1] Train Extra: lr=0.0000216 inv=0.4103125 sub=0.0000000
17-03-26 11:35:47 [1] Step: 91500 Acc: 0.71437 0.86119 Cost: 1.17474 0.75814 0.30270 0.11389 Time: 0.00081
17-03-26 11:35:47 [1] Train Extra: lr=0.0000216 inv=0.3953125 sub=0.0000000
17-03-26 11:36:42 [1] Step: 91500 Eval acc: 0.68860 0.85237 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-26 11:36:42 [1] Eval Extra: inv=0.4379417
17-03-26 11:38:11 [1] Step: 91600 Acc: 0.70250 0.85265 Cost: 0.96665 0.66462 0.18805 0.11398 Time: 0.00077
17-03-26 11:38:11 [1] Train Extra: lr=0.0000215 inv=0.4192187 sub=0.0000000
17-03-26 11:39:36 [1] Step: 91700 Acc: 0.70719 0.85076 Cost: 0.72470 0.47573 0.13495 0.11402 Time: 0.00075
17-03-26 11:39:36 [1] Train Extra: lr=0.0000215 inv=0.4246875 sub=0.0000000
17-03-26 11:40:49 [1] Step: 91800 Acc: 0.70437 0.84622 Cost: 0.85113 0.53520 0.20195 0.11398 Time: 0.00071
17-03-26 11:40:49 [1] Train Extra: lr=0.0000214 inv=0.3960938 sub=0.0000000
17-03-26 11:42:16 [1] Step: 91900 Acc: 0.69312 0.85347 Cost: 1.15725 0.78156 0.26168 0.11400 Time: 0.00076
17-03-26 11:42:16 [1] Train Extra: lr=0.0000213 inv=0.4220313 sub=0.0000000
17-03-26 11:43:42 [1] Step: 92000 Acc: 0.68906 0.84505 Cost: 1.46213 1.02260 0.32559 0.11394 Time: 0.00074
17-03-26 11:43:42 [1] Train Extra: lr=0.0000213 inv=0.4356250 sub=0.0000000
17-03-26 11:44:40 [1] Step: 92000 Eval acc: 0.68209 0.85657 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 11:44:40 [1] Eval Extra: inv=0.4027164
17-03-26 11:45:53 [1] Step: 92100 Acc: 0.70281 0.85027 Cost: 0.97927 0.60475 0.26053 0.11399 Time: 0.00072
17-03-26 11:45:53 [1] Train Extra: lr=0.0000212 inv=0.4128125 sub=0.0000000
17-03-26 11:47:12 [1] Step: 92200 Acc: 0.70312 0.85108 Cost: 1.19979 0.79706 0.28872 0.11400 Time: 0.00074
17-03-26 11:47:12 [1] Train Extra: lr=0.0000211 inv=0.4192187 sub=0.0000000
17-03-26 11:48:34 [1] Step: 92300 Acc: 0.71000 0.84783 Cost: 0.93438 0.66550 0.15478 0.11411 Time: 0.00076
17-03-26 11:48:34 [1] Train Extra: lr=0.0000211 inv=0.4156250 sub=0.0000000
17-03-26 11:49:54 [1] Step: 92400 Acc: 0.72625 0.84986 Cost: 1.03168 0.68592 0.23153 0.11423 Time: 0.00073
17-03-26 11:49:54 [1] Train Extra: lr=0.0000210 inv=0.4100000 sub=0.0000000
17-03-26 11:51:18 [1] Step: 92500 Acc: 0.74156 0.84447 Cost: 1.23142 0.91199 0.20504 0.11439 Time: 0.00075
17-03-26 11:51:18 [1] Train Extra: lr=0.0000210 inv=0.4364062 sub=0.0000000
17-03-26 11:52:16 [1] Step: 92500 Eval acc: 0.68485 0.85408 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 11:52:16 [1] Eval Extra: inv=0.3852142
17-03-26 11:53:36 [1] Step: 92600 Acc: 0.73031 0.84355 Cost: 0.88982 0.54241 0.23283 0.11459 Time: 0.00072
17-03-26 11:53:36 [1] Train Extra: lr=0.0000209 inv=0.4450000 sub=0.0000000
17-03-26 11:55:00 [1] Step: 92700 Acc: 0.74250 0.84431 Cost: 0.69150 0.48444 0.09237 0.11469 Time: 0.00075
17-03-26 11:55:00 [1] Train Extra: lr=0.0000208 inv=0.4412500 sub=0.0000000
17-03-26 11:56:15 [1] Step: 92800 Acc: 0.72000 0.85143 Cost: 1.13957 0.78834 0.23643 0.11480 Time: 0.00072
17-03-26 11:56:15 [1] Train Extra: lr=0.0000208 inv=0.4193750 sub=0.0000000
17-03-26 11:57:40 [1] Step: 92900 Acc: 0.72062 0.85171 Cost: 0.99603 0.67291 0.20824 0.11487 Time: 0.00078
17-03-26 11:57:40 [1] Train Extra: lr=0.0000207 inv=0.4389062 sub=0.0000000
17-03-26 11:58:59 [1] Step: 93000 Acc: 0.73469 0.84702 Cost: 1.02164 0.66461 0.24208 0.11496 Time: 0.00073
17-03-26 11:58:59 [1] Train Extra: lr=0.0000207 inv=0.4318750 sub=0.0000000
17-03-26 11:59:57 [1] Step: 93000 Eval acc: 0.68286 0.85280 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 11:59:57 [1] Eval Extra: inv=0.4252981
17-03-26 12:01:11 [1] Step: 93100 Acc: 0.71344 0.85395 Cost: 0.92470 0.65001 0.15968 0.11501 Time: 0.00071
17-03-26 12:01:11 [1] Train Extra: lr=0.0000206 inv=0.4067188 sub=0.0000000
17-03-26 12:02:24 [1] Step: 93200 Acc: 0.71375 0.85276 Cost: 0.81497 0.52742 0.17242 0.11512 Time: 0.00072
17-03-26 12:02:24 [1] Train Extra: lr=0.0000205 inv=0.4204688 sub=0.0000000
17-03-26 12:03:45 [1] Step: 93300 Acc: 0.70500 0.85530 Cost: 0.83765 0.57129 0.15115 0.11521 Time: 0.00076
17-03-26 12:03:45 [1] Train Extra: lr=0.0000205 inv=0.4134375 sub=0.0000000
17-03-26 12:05:01 [1] Step: 93400 Acc: 0.73813 0.85044 Cost: 1.12985 0.78541 0.22921 0.11523 Time: 0.00075
17-03-26 12:05:01 [1] Train Extra: lr=0.0000204 inv=0.4071875 sub=0.0000000
17-03-26 12:06:27 [1] Step: 93500 Acc: 0.72250 0.84336 Cost: 1.00302 0.73975 0.14786 0.11541 Time: 0.00075
17-03-26 12:06:27 [1] Train Extra: lr=0.0000204 inv=0.4385938 sub=0.0000000
17-03-26 12:07:25 [1] Step: 93500 Eval acc: 0.68507 0.85749 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 12:07:25 [1] Eval Extra: inv=0.4400950
17-03-26 12:08:50 [1] Step: 93600 Acc: 0.71500 0.84004 Cost: 0.85136 0.51348 0.22238 0.11550 Time: 0.00073
17-03-26 12:08:50 [1] Train Extra: lr=0.0000203 inv=0.4468750 sub=0.0000000
17-03-26 12:10:10 [1] Step: 93700 Acc: 0.72375 0.84540 Cost: 0.91460 0.57332 0.22569 0.11560 Time: 0.00074
17-03-26 12:10:10 [1] Train Extra: lr=0.0000203 inv=0.4039063 sub=0.0000000
17-03-26 12:11:30 [1] Step: 93800 Acc: 0.71750 0.85026 Cost: 0.78550 0.56592 0.10391 0.11567 Time: 0.00074
17-03-26 12:11:30 [1] Train Extra: lr=0.0000202 inv=0.4560938 sub=0.0000000
17-03-26 12:12:45 [1] Step: 93900 Acc: 0.72656 0.84931 Cost: 1.29740 0.92167 0.25994 0.11579 Time: 0.00072
17-03-26 12:12:45 [1] Train Extra: lr=0.0000201 inv=0.4231250 sub=0.0000000
17-03-26 12:14:09 [1] Step: 94000 Acc: 0.70625 0.84994 Cost: 1.08568 0.70355 0.26624 0.11589 Time: 0.00076
17-03-26 12:14:09 [1] Train Extra: lr=0.0000201 inv=0.4171875 sub=0.0000000
17-03-26 12:15:07 [1] Step: 94000 Eval acc: 0.68021 0.85103 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 12:15:07 [1] Eval Extra: inv=0.4323101
17-03-26 12:16:32 [1] Step: 94100 Acc: 0.71375 0.84620 Cost: 1.04526 0.78976 0.13955 0.11596 Time: 0.00076
17-03-26 12:16:32 [1] Train Extra: lr=0.0000200 inv=0.4242187 sub=0.0000000
17-03-26 12:17:49 [1] Step: 94200 Acc: 0.71781 0.85468 Cost: 1.01897 0.69613 0.20684 0.11600 Time: 0.00073
17-03-26 12:17:49 [1] Train Extra: lr=0.0000200 inv=0.3976562 sub=0.0000000
17-03-26 12:19:22 [1] Step: 94300 Acc: 0.71125 0.85240 Cost: 0.90087 0.54127 0.24353 0.11606 Time: 0.00079
17-03-26 12:19:22 [1] Train Extra: lr=0.0000199 inv=0.4446875 sub=0.0000000
17-03-26 12:20:41 [1] Step: 94400 Acc: 0.72937 0.84851 Cost: 1.02956 0.74147 0.17191 0.11618 Time: 0.00075
17-03-26 12:20:41 [1] Train Extra: lr=0.0000198 inv=0.4037500 sub=0.0000000
17-03-26 12:22:00 [1] Step: 94500 Acc: 0.72469 0.84673 Cost: 0.80228 0.56033 0.12571 0.11624 Time: 0.00072
17-03-26 12:22:00 [1] Train Extra: lr=0.0000198 inv=0.4267187 sub=0.0000000
17-03-26 12:22:57 [1] Step: 94500 Eval acc: 0.68706 0.85134 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 12:22:57 [1] Eval Extra: inv=0.4103357
17-03-26 12:24:21 [1] Step: 94600 Acc: 0.71031 0.84923 Cost: 1.06681 0.87455 0.07595 0.11631 Time: 0.00075
17-03-26 12:24:21 [1] Train Extra: lr=0.0000197 inv=0.4040625 sub=0.0000000
17-03-26 12:25:49 [1] Step: 94700 Acc: 0.71313 0.84818 Cost: 1.31267 0.90627 0.28999 0.11640 Time: 0.00076
17-03-26 12:25:49 [1] Train Extra: lr=0.0000197 inv=0.4248438 sub=0.0000000
17-03-26 12:27:12 [1] Step: 94800 Acc: 0.72437 0.84984 Cost: 0.92969 0.72062 0.09254 0.11653 Time: 0.00078
17-03-26 12:27:12 [1] Train Extra: lr=0.0000196 inv=0.4007812 sub=0.0000000
17-03-26 12:28:26 [1] Step: 94900 Acc: 0.72594 0.85609 Cost: 1.19047 0.79627 0.27759 0.11661 Time: 0.00074
17-03-26 12:28:26 [1] Train Extra: lr=0.0000196 inv=0.4050000 sub=0.0000000
17-03-26 12:29:51 [1] Step: 95000 Acc: 0.71219 0.84823 Cost: 1.22412 0.84653 0.26086 0.11673 Time: 0.00076
17-03-26 12:29:51 [1] Train Extra: lr=0.0000195 inv=0.4265625 sub=0.0000000
17-03-26 12:30:46 [1] Step: 95000 Eval acc: 0.68463 0.85549 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-26 12:30:46 [1] Eval Extra: inv=0.4149183
17-03-26 12:30:46 [1] Checkpointing.
17-03-26 12:32:04 [1] Step: 95100 Acc: 0.72375 0.84297 Cost: 0.93966 0.64123 0.18169 0.11674 Time: 0.00073
17-03-26 12:32:04 [1] Train Extra: lr=0.0000195 inv=0.4104687 sub=0.0000000
17-03-26 12:33:30 [1] Step: 95200 Acc: 0.71625 0.84076 Cost: 1.22881 0.77848 0.33355 0.11678 Time: 0.00074
17-03-26 12:33:30 [1] Train Extra: lr=0.0000194 inv=0.4359375 sub=0.0000000
17-03-26 12:34:49 [1] Step: 95300 Acc: 0.69750 0.85096 Cost: 0.88927 0.60305 0.16939 0.11683 Time: 0.00073
17-03-26 12:34:49 [1] Train Extra: lr=0.0000193 inv=0.4095313 sub=0.0000000
17-03-26 12:36:15 [1] Step: 95400 Acc: 0.71906 0.85359 Cost: 0.97702 0.57449 0.28560 0.11693 Time: 0.00077
17-03-26 12:36:15 [1] Train Extra: lr=0.0000193 inv=0.4268750 sub=0.0000000
17-03-26 12:37:41 [1] Step: 95500 Acc: 0.71688 0.85067 Cost: 0.87182 0.55570 0.19908 0.11704 Time: 0.00076
17-03-26 12:37:41 [1] Train Extra: lr=0.0000192 inv=0.4279688 sub=0.0000000
17-03-26 12:38:36 [1] Step: 95500 Eval acc: 0.68330 0.85593 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-26 12:38:36 [1] Eval Extra: inv=0.4071886
17-03-26 12:39:55 [1] Step: 95600 Acc: 0.71375 0.85258 Cost: 0.95546 0.61835 0.22000 0.11711 Time: 0.00075
17-03-26 12:39:55 [1] Train Extra: lr=0.0000192 inv=0.4096875 sub=0.0000000
17-03-26 12:41:22 [1] Step: 95700 Acc: 0.69656 0.85872 Cost: 0.83163 0.56147 0.15296 0.11721 Time: 0.00077
17-03-26 12:41:22 [1] Train Extra: lr=0.0000191 inv=0.4265625 sub=0.0000000
17-03-26 12:42:54 [1] Step: 95800 Acc: 0.69688 0.85766 Cost: 0.97562 0.59703 0.26135 0.11723 Time: 0.00080
17-03-26 12:42:54 [1] Train Extra: lr=0.0000191 inv=0.4220313 sub=0.0000000
17-03-26 12:44:21 [1] Step: 95900 Acc: 0.70688 0.84774 Cost: 1.10305 0.72083 0.26492 0.11730 Time: 0.00075
17-03-26 12:44:21 [1] Train Extra: lr=0.0000190 inv=0.4285937 sub=0.0000000
17-03-26 12:45:40 [1] Step: 96000 Acc: 0.72281 0.84658 Cost: 1.15840 0.77771 0.26331 0.11738 Time: 0.00074
17-03-26 12:45:40 [1] Train Extra: lr=0.0000190 inv=0.4165625 sub=0.0000000
17-03-26 12:46:36 [1] Step: 96000 Eval acc: 0.68905 0.85613 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 12:46:36 [1] Eval Extra: inv=0.3995693
17-03-26 12:48:00 [1] Step: 96100 Acc: 0.71625 0.84736 Cost: 0.81497 0.56846 0.12912 0.11739 Time: 0.00075
17-03-26 12:48:00 [1] Train Extra: lr=0.0000189 inv=0.4295312 sub=0.0000000
17-03-26 12:49:15 [1] Step: 96200 Acc: 0.71125 0.85208 Cost: 0.78915 0.57706 0.09462 0.11748 Time: 0.00073
17-03-26 12:49:15 [1] Train Extra: lr=0.0000188 inv=0.4165625 sub=0.0000000
17-03-26 12:50:40 [1] Step: 96300 Acc: 0.72500 0.85024 Cost: 0.89598 0.57158 0.20685 0.11754 Time: 0.00078
17-03-26 12:50:40 [1] Train Extra: lr=0.0000188 inv=0.4035937 sub=0.0000000
17-03-26 12:51:55 [1] Step: 96400 Acc: 0.72875 0.84659 Cost: 0.88487 0.55310 0.21412 0.11766 Time: 0.00070
17-03-26 12:51:55 [1] Train Extra: lr=0.0000187 inv=0.4206250 sub=0.0000000
17-03-26 12:53:15 [1] Step: 96500 Acc: 0.72094 0.84576 Cost: 1.30216 0.89582 0.28860 0.11774 Time: 0.00076
17-03-26 12:53:15 [1] Train Extra: lr=0.0000187 inv=0.4295312 sub=0.0000000
17-03-26 12:54:13 [1] Step: 96500 Eval acc: 0.68364 0.85459 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 12:54:13 [1] Eval Extra: inv=0.4384386
17-03-26 12:55:40 [1] Step: 96600 Acc: 0.70656 0.84647 Cost: 0.71879 0.52372 0.07733 0.11774 Time: 0.00074
17-03-26 12:55:40 [1] Train Extra: lr=0.0000186 inv=0.4551562 sub=0.0000000
17-03-26 12:56:58 [1] Step: 96700 Acc: 0.71250 0.85508 Cost: 0.79023 0.60899 0.06344 0.11780 Time: 0.00075
17-03-26 12:56:58 [1] Train Extra: lr=0.0000186 inv=0.4092188 sub=0.0000000
17-03-26 12:58:15 [1] Step: 96800 Acc: 0.71875 0.84915 Cost: 0.78830 0.55107 0.11947 0.11776 Time: 0.00074
17-03-26 12:58:15 [1] Train Extra: lr=0.0000185 inv=0.4120313 sub=0.0000000
17-03-26 12:59:40 [1] Step: 96900 Acc: 0.71906 0.85061 Cost: 0.88066 0.59664 0.16615 0.11787 Time: 0.00076
17-03-26 12:59:40 [1] Train Extra: lr=0.0000185 inv=0.4148438 sub=0.0000000
17-03-26 13:01:02 [1] Step: 97000 Acc: 0.70875 0.84513 Cost: 1.08884 0.73858 0.23237 0.11789 Time: 0.00073
17-03-26 13:01:02 [1] Train Extra: lr=0.0000184 inv=0.4148438 sub=0.0000000
17-03-26 13:02:00 [1] Step: 97000 Eval acc: 0.68617 0.85289 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 13:02:00 [1] Eval Extra: inv=0.4142005
17-03-26 13:03:28 [1] Step: 97100 Acc: 0.71031 0.85416 Cost: 0.96061 0.72979 0.11286 0.11796 Time: 0.00078
17-03-26 13:03:28 [1] Train Extra: lr=0.0000184 inv=0.4117188 sub=0.0000000
17-03-26 13:04:46 [1] Step: 97200 Acc: 0.71844 0.84953 Cost: 1.34607 1.00172 0.22635 0.11800 Time: 0.00074
17-03-26 13:04:46 [1] Train Extra: lr=0.0000183 inv=0.3985938 sub=0.0000000
17-03-26 13:06:17 [1] Step: 97300 Acc: 0.70500 0.85032 Cost: 0.82727 0.60789 0.10129 0.11809 Time: 0.00078
17-03-26 13:06:17 [1] Train Extra: lr=0.0000183 inv=0.4300000 sub=0.0000000
17-03-26 13:07:36 [1] Step: 97400 Acc: 0.70312 0.85069 Cost: 1.22045 0.83679 0.26558 0.11808 Time: 0.00073
17-03-26 13:07:36 [1] Train Extra: lr=0.0000182 inv=0.4212500 sub=0.0000000
17-03-26 13:09:11 [1] Step: 97500 Acc: 0.70844 0.85677 Cost: 1.04129 0.67458 0.24855 0.11815 Time: 0.00082
17-03-26 13:09:11 [1] Train Extra: lr=0.0000182 inv=0.4068750 sub=0.0000000
17-03-26 13:10:06 [1] Step: 97500 Eval acc: 0.68275 0.85262 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00016
17-03-26 13:10:06 [1] Eval Extra: inv=0.4255190
17-03-26 13:11:29 [1] Step: 97600 Acc: 0.69750 0.84974 Cost: 1.03621 0.65518 0.26286 0.11817 Time: 0.00076
17-03-26 13:11:29 [1] Train Extra: lr=0.0000181 inv=0.4279688 sub=0.0000000
17-03-26 13:12:47 [1] Step: 97700 Acc: 0.70875 0.85126 Cost: 1.19963 0.82406 0.25733 0.11824 Time: 0.00074
17-03-26 13:12:47 [1] Train Extra: lr=0.0000180 inv=0.4246875 sub=0.0000000
17-03-26 13:14:07 [1] Step: 97800 Acc: 0.71156 0.85220 Cost: 1.34708 0.86978 0.35901 0.11829 Time: 0.00076
17-03-26 13:14:07 [1] Train Extra: lr=0.0000180 inv=0.4200000 sub=0.0000000
17-03-26 13:15:34 [1] Step: 97900 Acc: 0.70813 0.84592 Cost: 1.18399 0.79530 0.27037 0.11832 Time: 0.00074
17-03-26 13:15:34 [1] Train Extra: lr=0.0000179 inv=0.4459375 sub=0.0000000
17-03-26 13:16:53 [1] Step: 98000 Acc: 0.71656 0.84989 Cost: 0.87784 0.49676 0.26266 0.11841 Time: 0.00074
17-03-26 13:16:53 [1] Train Extra: lr=0.0000179 inv=0.4309375 sub=0.0000000
17-03-26 13:17:50 [1] Step: 98000 Eval acc: 0.68617 0.85582 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 13:17:50 [1] Eval Extra: inv=0.4336904
17-03-26 13:19:11 [1] Step: 98100 Acc: 0.71437 0.85026 Cost: 0.83518 0.52378 0.19298 0.11842 Time: 0.00075
17-03-26 13:19:11 [1] Train Extra: lr=0.0000178 inv=0.4210937 sub=0.0000000
17-03-26 13:20:43 [1] Step: 98200 Acc: 0.69969 0.84563 Cost: 1.40089 0.97715 0.30525 0.11849 Time: 0.00077
17-03-26 13:20:43 [1] Train Extra: lr=0.0000178 inv=0.4379688 sub=0.0000000
17-03-26 13:22:15 [1] Step: 98300 Acc: 0.69437 0.85356 Cost: 0.87453 0.61600 0.13997 0.11857 Time: 0.00081
17-03-26 13:22:15 [1] Train Extra: lr=0.0000177 inv=0.4062500 sub=0.0000000
17-03-26 13:23:37 [1] Step: 98400 Acc: 0.71656 0.85077 Cost: 0.79037 0.52054 0.15131 0.11852 Time: 0.00073
17-03-26 13:23:37 [1] Train Extra: lr=0.0000177 inv=0.4442187 sub=0.0000000
17-03-26 13:24:56 [1] Step: 98500 Acc: 0.70656 0.85426 Cost: 0.92499 0.70310 0.10333 0.11856 Time: 0.00075
17-03-26 13:24:56 [1] Train Extra: lr=0.0000176 inv=0.4104687 sub=0.0000000
17-03-26 13:25:55 [1] Step: 98500 Eval acc: 0.68949 0.84893 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 13:25:55 [1] Eval Extra: inv=0.4145870
17-03-26 13:27:22 [1] Step: 98600 Acc: 0.70500 0.84469 Cost: 1.39102 1.00061 0.27178 0.11863 Time: 0.00075
17-03-26 13:27:22 [1] Train Extra: lr=0.0000176 inv=0.4285937 sub=0.0000000
17-03-26 13:28:45 [1] Step: 98700 Acc: 0.71406 0.85010 Cost: 0.79330 0.52680 0.14788 0.11862 Time: 0.00076
17-03-26 13:28:45 [1] Train Extra: lr=0.0000175 inv=0.3978125 sub=0.0000000
17-03-26 13:30:18 [1] Step: 98800 Acc: 0.70312 0.84379 Cost: 0.89354 0.66733 0.10761 0.11860 Time: 0.00075
17-03-26 13:30:18 [1] Train Extra: lr=0.0000175 inv=0.4518750 sub=0.0000000
17-03-26 13:31:45 [1] Step: 98900 Acc: 0.71469 0.85164 Cost: 1.12957 0.81107 0.19987 0.11864 Time: 0.00074
17-03-26 13:31:45 [1] Train Extra: lr=0.0000174 inv=0.4584375 sub=0.0000000
17-03-26 13:33:04 [1] Step: 99000 Acc: 0.70781 0.84970 Cost: 0.79467 0.56829 0.10771 0.11867 Time: 0.00075
17-03-26 13:33:04 [1] Train Extra: lr=0.0000174 inv=0.4006250 sub=0.0000000
17-03-26 13:34:02 [1] Step: 99000 Eval acc: 0.68231 0.84676 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 13:34:02 [1] Eval Extra: inv=0.4112743
17-03-26 13:35:16 [1] Step: 99100 Acc: 0.69594 0.85430 Cost: 0.92651 0.66442 0.14339 0.11870 Time: 0.00073
17-03-26 13:35:16 [1] Train Extra: lr=0.0000173 inv=0.3962500 sub=0.0000000
17-03-26 13:36:28 [1] Step: 99200 Acc: 0.69281 0.84907 Cost: 1.20328 0.78565 0.29890 0.11873 Time: 0.00073
17-03-26 13:36:28 [1] Train Extra: lr=0.0000173 inv=0.4110937 sub=0.0000000
17-03-26 13:37:54 [1] Step: 99300 Acc: 0.69719 0.84325 Cost: 0.95566 0.66471 0.17220 0.11875 Time: 0.00073
17-03-26 13:37:54 [1] Train Extra: lr=0.0000172 inv=0.4525000 sub=0.0000000
17-03-26 13:39:21 [1] Step: 99400 Acc: 0.71188 0.84363 Cost: 1.19452 0.86897 0.20679 0.11875 Time: 0.00075
17-03-26 13:39:21 [1] Train Extra: lr=0.0000172 inv=0.4350000 sub=0.0000000
17-03-26 13:40:44 [1] Step: 99500 Acc: 0.72094 0.85757 Cost: 1.05065 0.74023 0.19168 0.11873 Time: 0.00078
17-03-26 13:40:44 [1] Train Extra: lr=0.0000171 inv=0.4245313 sub=0.0000000
17-03-26 13:41:41 [1] Step: 99500 Eval acc: 0.68242 0.85191 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 13:41:41 [1] Eval Extra: inv=0.4355124
17-03-26 13:43:12 [1] Step: 99600 Acc: 0.71125 0.84876 Cost: 1.50719 1.06284 0.32560 0.11875 Time: 0.00077
17-03-26 13:43:12 [1] Train Extra: lr=0.0000171 inv=0.4432813 sub=0.0000000
17-03-26 13:44:45 [1] Step: 99700 Acc: 0.69969 0.85808 Cost: 0.54435 0.35236 0.07313 0.11886 Time: 0.00079
17-03-26 13:44:45 [1] Train Extra: lr=0.0000170 inv=0.4084375 sub=0.0000000
17-03-26 13:46:06 [1] Step: 99800 Acc: 0.70344 0.85345 Cost: 0.98802 0.66099 0.20818 0.11885 Time: 0.00074
17-03-26 13:46:06 [1] Train Extra: lr=0.0000170 inv=0.4051562 sub=0.0000000
17-03-26 13:47:26 [1] Step: 99900 Acc: 0.71562 0.85743 Cost: 1.02052 0.66699 0.23463 0.11890 Time: 0.00076
17-03-26 13:47:26 [1] Train Extra: lr=0.0000169 inv=0.4121875 sub=0.0000000
17-03-26 13:48:38 [1] Step: 100000 Acc: 0.72281 0.85445 Cost: 1.24126 0.86152 0.26085 0.11889 Time: 0.00075
17-03-26 13:48:38 [1] Train Extra: lr=0.0000169 inv=0.3893750 sub=0.0000000
17-03-26 13:49:35 [1] Step: 100000 Eval acc: 0.68209 0.85454 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 13:49:35 [1] Eval Extra: inv=0.4079064
17-03-26 13:49:35 [1] Checkpointing.
17-03-26 13:51:00 [1] Step: 100100 Acc: 0.69781 0.84767 Cost: 1.17913 0.71065 0.34954 0.11895 Time: 0.00075
17-03-26 13:51:00 [1] Train Extra: lr=0.0000168 inv=0.4501562 sub=0.0000000
17-03-26 13:52:37 [1] Step: 100200 Acc: 0.69500 0.84927 Cost: 0.87828 0.54796 0.21125 0.11908 Time: 0.00079
17-03-26 13:52:37 [1] Train Extra: lr=0.0000168 inv=0.4476562 sub=0.0000000
17-03-26 13:53:52 [1] Step: 100300 Acc: 0.70375 0.85137 Cost: 0.81044 0.48257 0.20877 0.11910 Time: 0.00072
17-03-26 13:53:52 [1] Train Extra: lr=0.0000167 inv=0.4142188 sub=0.0000000
17-03-26 13:55:17 [1] Step: 100400 Acc: 0.69906 0.85521 Cost: 1.44001 1.02463 0.29624 0.11914 Time: 0.00077
17-03-26 13:55:17 [1] Train Extra: lr=0.0000167 inv=0.4160937 sub=0.0000000
17-03-26 13:56:42 [1] Step: 100500 Acc: 0.70156 0.84750 Cost: 1.13423 0.79157 0.22356 0.11910 Time: 0.00075
17-03-26 13:56:42 [1] Train Extra: lr=0.0000167 inv=0.4498437 sub=0.0000000
17-03-26 13:57:41 [1] Step: 100500 Eval acc: 0.68485 0.85090 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 13:57:41 [1] Eval Extra: inv=0.4295495
17-03-26 13:59:09 [1] Step: 100600 Acc: 0.70844 0.85197 Cost: 1.53558 1.11135 0.30509 0.11914 Time: 0.00079
17-03-26 13:59:09 [1] Train Extra: lr=0.0000166 inv=0.4203125 sub=0.0000000
17-03-26 14:00:39 [1] Step: 100700 Acc: 0.70156 0.85691 Cost: 1.04694 0.68333 0.24447 0.11914 Time: 0.00081
17-03-26 14:00:39 [1] Train Extra: lr=0.0000166 inv=0.4356250 sub=0.0000000
17-03-26 14:02:11 [1] Step: 100800 Acc: 0.72250 0.85341 Cost: 0.96689 0.58363 0.26398 0.11928 Time: 0.00078
17-03-26 14:02:11 [1] Train Extra: lr=0.0000165 inv=0.4489063 sub=0.0000000
17-03-26 14:03:25 [1] Step: 100900 Acc: 0.74031 0.84973 Cost: 0.87622 0.57558 0.18120 0.11944 Time: 0.00071
17-03-26 14:03:25 [1] Train Extra: lr=0.0000165 inv=0.3834375 sub=0.0000000
17-03-26 14:04:43 [1] Step: 101000 Acc: 0.72313 0.84845 Cost: 1.01136 0.64326 0.24853 0.11957 Time: 0.00073
17-03-26 14:04:43 [1] Train Extra: lr=0.0000164 inv=0.4093750 sub=0.0000000
17-03-26 14:05:40 [1] Step: 101000 Eval acc: 0.67977 0.85126 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 14:05:40 [1] Eval Extra: inv=0.4268441
17-03-26 14:06:56 [1] Step: 101100 Acc: 0.73594 0.84828 Cost: 0.97834 0.59921 0.25952 0.11961 Time: 0.00073
17-03-26 14:06:56 [1] Train Extra: lr=0.0000164 inv=0.3946875 sub=0.0000000
17-03-26 14:08:12 [1] Step: 101200 Acc: 0.73156 0.85236 Cost: 1.08921 0.80287 0.16660 0.11974 Time: 0.00071
17-03-26 14:08:12 [1] Train Extra: lr=0.0000163 inv=0.4237500 sub=0.0000000
17-03-26 14:09:46 [1] Step: 101300 Acc: 0.72000 0.84845 Cost: 0.78216 0.53093 0.13137 0.11987 Time: 0.00080
17-03-26 14:09:46 [1] Train Extra: lr=0.0000163 inv=0.4481250 sub=0.0000000
17-03-26 14:11:17 [1] Step: 101400 Acc: 0.72969 0.84264 Cost: 0.81478 0.55087 0.14392 0.12000 Time: 0.00076
17-03-26 14:11:17 [1] Train Extra: lr=0.0000162 inv=0.4535938 sub=0.0000000
17-03-26 14:12:38 [1] Step: 101500 Acc: 0.74531 0.84356 Cost: 0.87407 0.59390 0.16007 0.12010 Time: 0.00071
17-03-26 14:12:38 [1] Train Extra: lr=0.0000162 inv=0.4337500 sub=0.0000000
17-03-26 14:13:35 [1] Step: 101500 Eval acc: 0.68154 0.85383 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 14:13:35 [1] Eval Extra: inv=0.4236418
17-03-26 14:14:58 [1] Step: 101600 Acc: 0.73281 0.84237 Cost: 1.13734 0.72889 0.28820 0.12025 Time: 0.00075
17-03-26 14:14:58 [1] Train Extra: lr=0.0000161 inv=0.4415625 sub=0.0000000
17-03-26 14:16:23 [1] Step: 101700 Acc: 0.71688 0.85677 Cost: 1.19865 0.86135 0.21692 0.12038 Time: 0.00076
17-03-26 14:16:23 [1] Train Extra: lr=0.0000161 inv=0.4218750 sub=0.0000000
17-03-26 14:17:41 [1] Step: 101800 Acc: 0.72781 0.84622 Cost: 1.17239 0.78962 0.26237 0.12040 Time: 0.00072
17-03-26 14:17:41 [1] Train Extra: lr=0.0000160 inv=0.4248438 sub=0.0000000
17-03-26 14:19:09 [1] Step: 101900 Acc: 0.71906 0.84839 Cost: 1.06793 0.72782 0.21962 0.12049 Time: 0.00075
17-03-26 14:19:09 [1] Train Extra: lr=0.0000160 inv=0.4360938 sub=0.0000000
17-03-26 14:20:26 [1] Step: 102000 Acc: 0.73687 0.84763 Cost: 0.72622 0.47770 0.12792 0.12060 Time: 0.00073
17-03-26 14:20:26 [1] Train Extra: lr=0.0000159 inv=0.4135937 sub=0.0000000
17-03-26 14:21:24 [1] Step: 102000 Eval acc: 0.68165 0.85301 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 14:21:24 [1] Eval Extra: inv=0.4011705
17-03-26 14:22:41 [1] Step: 102100 Acc: 0.73406 0.84892 Cost: 0.84969 0.51918 0.20972 0.12079 Time: 0.00074
17-03-26 14:22:41 [1] Train Extra: lr=0.0000159 inv=0.3925000 sub=0.0000000
17-03-26 14:24:06 [1] Step: 102200 Acc: 0.71875 0.85397 Cost: 0.97085 0.57595 0.27399 0.12091 Time: 0.00076
17-03-26 14:24:06 [1] Train Extra: lr=0.0000159 inv=0.4112500 sub=0.0000000
17-03-26 14:25:26 [1] Step: 102300 Acc: 0.72344 0.84083 Cost: 0.88522 0.64614 0.11812 0.12096 Time: 0.00072
17-03-26 14:25:26 [1] Train Extra: lr=0.0000158 inv=0.4153125 sub=0.0000000
17-03-26 14:26:47 [1] Step: 102400 Acc: 0.72875 0.84996 Cost: 1.16144 0.71981 0.32060 0.12103 Time: 0.00075
17-03-26 14:26:47 [1] Train Extra: lr=0.0000158 inv=0.3935938 sub=0.0000000
17-03-26 14:28:06 [1] Step: 102500 Acc: 0.72719 0.85343 Cost: 0.78750 0.59364 0.07265 0.12120 Time: 0.00076
17-03-26 14:28:06 [1] Train Extra: lr=0.0000157 inv=0.4020313 sub=0.0000000
17-03-26 14:29:03 [1] Step: 102500 Eval acc: 0.68065 0.84982 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 14:29:03 [1] Eval Extra: inv=0.3843308
17-03-26 14:30:31 [1] Step: 102600 Acc: 0.72406 0.85327 Cost: 0.93532 0.57407 0.23993 0.12133 Time: 0.00077
17-03-26 14:30:31 [1] Train Extra: lr=0.0000157 inv=0.4376563 sub=0.0000000
17-03-26 14:32:02 [1] Step: 102700 Acc: 0.71469 0.85278 Cost: 0.86136 0.56937 0.17056 0.12142 Time: 0.00079
17-03-26 14:32:02 [1] Train Extra: lr=0.0000156 inv=0.4334375 sub=0.0000000
17-03-26 14:33:16 [1] Step: 102800 Acc: 0.71688 0.85552 Cost: 1.05448 0.69816 0.23479 0.12153 Time: 0.00073
17-03-26 14:33:16 [1] Train Extra: lr=0.0000156 inv=0.4123438 sub=0.0000000
17-03-26 14:34:36 [1] Step: 102900 Acc: 0.72875 0.84456 Cost: 0.97327 0.58896 0.26276 0.12154 Time: 0.00075
17-03-26 14:34:36 [1] Train Extra: lr=0.0000155 inv=0.4304688 sub=0.0000000
17-03-26 14:35:57 [1] Step: 103000 Acc: 0.72594 0.85616 Cost: 1.03788 0.68567 0.23055 0.12166 Time: 0.00075
17-03-26 14:35:57 [1] Train Extra: lr=0.0000155 inv=0.4285937 sub=0.0000000
17-03-26 14:36:54 [1] Step: 103000 Eval acc: 0.68805 0.85878 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 14:36:54 [1] Eval Extra: inv=0.3923918
17-03-26 14:38:12 [1] Step: 103100 Acc: 0.73156 0.85509 Cost: 0.71920 0.42585 0.17162 0.12173 Time: 0.00075
17-03-26 14:38:12 [1] Train Extra: lr=0.0000155 inv=0.3892188 sub=0.0000000
17-03-26 14:39:31 [1] Step: 103200 Acc: 0.72281 0.84849 Cost: 1.15814 0.76688 0.26947 0.12180 Time: 0.00073
17-03-26 14:39:31 [1] Train Extra: lr=0.0000154 inv=0.4210937 sub=0.0000000
17-03-26 14:41:03 [1] Step: 103300 Acc: 0.72469 0.85160 Cost: 1.19356 0.79340 0.27823 0.12193 Time: 0.00078
17-03-26 14:41:03 [1] Train Extra: lr=0.0000154 inv=0.4421875 sub=0.0000000
17-03-26 14:42:28 [1] Step: 103400 Acc: 0.71562 0.84853 Cost: 0.83777 0.61267 0.10312 0.12199 Time: 0.00077
17-03-26 14:42:28 [1] Train Extra: lr=0.0000153 inv=0.4354688 sub=0.0000000
17-03-26 14:43:43 [1] Step: 103500 Acc: 0.71750 0.84586 Cost: 1.02493 0.63067 0.27220 0.12207 Time: 0.00072
17-03-26 14:43:43 [1] Train Extra: lr=0.0000153 inv=0.4179688 sub=0.0000000
17-03-26 14:44:42 [1] Step: 103500 Eval acc: 0.67833 0.85566 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 14:44:42 [1] Eval Extra: inv=0.4358989
17-03-26 14:46:00 [1] Step: 103600 Acc: 0.72437 0.85087 Cost: 1.02778 0.66547 0.24013 0.12218 Time: 0.00075
17-03-26 14:46:00 [1] Train Extra: lr=0.0000152 inv=0.4093750 sub=0.0000000
17-03-26 14:47:21 [1] Step: 103700 Acc: 0.72875 0.85498 Cost: 1.12699 0.77974 0.22506 0.12219 Time: 0.00075
17-03-26 14:47:21 [1] Train Extra: lr=0.0000152 inv=0.4110937 sub=0.0000000
17-03-26 14:48:46 [1] Step: 103800 Acc: 0.72094 0.84675 Cost: 1.10252 0.68604 0.29426 0.12222 Time: 0.00075
17-03-26 14:48:46 [1] Train Extra: lr=0.0000151 inv=0.4425000 sub=0.0000000
17-03-26 14:50:16 [1] Step: 103900 Acc: 0.72375 0.84798 Cost: 0.94052 0.61038 0.20782 0.12232 Time: 0.00077
17-03-26 14:50:16 [1] Train Extra: lr=0.0000151 inv=0.4456250 sub=0.0000000
17-03-26 14:51:58 [1] Step: 104000 Acc: 0.69437 0.85186 Cost: 1.07132 0.74641 0.20253 0.12239 Time: 0.00078
17-03-26 14:51:58 [1] Train Extra: lr=0.0000151 inv=0.4700000 sub=0.0000000
17-03-26 14:52:55 [1] Step: 104000 Eval acc: 0.67193 0.85361 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 14:52:55 [1] Eval Extra: inv=0.4190592
17-03-26 14:54:26 [1] Step: 104100 Acc: 0.71437 0.85142 Cost: 0.85921 0.51676 0.22000 0.12245 Time: 0.00078
17-03-26 14:54:26 [1] Train Extra: lr=0.0000150 inv=0.4479688 sub=0.0000000
17-03-26 14:55:38 [1] Step: 104200 Acc: 0.72188 0.85067 Cost: 1.44495 1.07018 0.25225 0.12253 Time: 0.00073
17-03-26 14:55:38 [1] Train Extra: lr=0.0000150 inv=0.3931250 sub=0.0000000
17-03-26 14:57:04 [1] Step: 104300 Acc: 0.72781 0.84831 Cost: 0.90318 0.63061 0.14998 0.12259 Time: 0.00076
17-03-26 14:57:04 [1] Train Extra: lr=0.0000149 inv=0.4129687 sub=0.0000000
17-03-26 14:58:25 [1] Step: 104400 Acc: 0.70094 0.85273 Cost: 0.85297 0.49397 0.23639 0.12261 Time: 0.00073
17-03-26 14:58:25 [1] Train Extra: lr=0.0000149 inv=0.4385938 sub=0.0000000
17-03-26 14:59:44 [1] Step: 104500 Acc: 0.72344 0.84350 Cost: 0.90269 0.54300 0.23695 0.12274 Time: 0.00072
17-03-26 14:59:44 [1] Train Extra: lr=0.0000148 inv=0.4293750 sub=0.0000000
17-03-26 15:00:42 [1] Step: 104500 Eval acc: 0.68330 0.85298 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 15:00:42 [1] Eval Extra: inv=0.4132619
17-03-26 15:02:00 [1] Step: 104600 Acc: 0.71313 0.85602 Cost: 0.95590 0.61126 0.22189 0.12275 Time: 0.00074
17-03-26 15:02:00 [1] Train Extra: lr=0.0000148 inv=0.4068750 sub=0.0000000
17-03-26 15:03:20 [1] Step: 104700 Acc: 0.72281 0.85176 Cost: 0.98094 0.64883 0.20930 0.12281 Time: 0.00074
17-03-26 15:03:20 [1] Train Extra: lr=0.0000148 inv=0.4342187 sub=0.0000000
17-03-26 15:04:52 [1] Step: 104800 Acc: 0.71781 0.85020 Cost: 0.95032 0.63403 0.19344 0.12285 Time: 0.00078
17-03-26 15:04:52 [1] Train Extra: lr=0.0000147 inv=0.4503125 sub=0.0000000
17-03-26 15:06:14 [1] Step: 104900 Acc: 0.72500 0.84940 Cost: 1.21699 0.80366 0.29040 0.12293 Time: 0.00076
17-03-26 15:06:14 [1] Train Extra: lr=0.0000147 inv=0.4056250 sub=0.0000000
17-03-26 15:07:38 [1] Step: 105000 Acc: 0.72062 0.85560 Cost: 0.78330 0.46826 0.19204 0.12300 Time: 0.00079
17-03-26 15:07:38 [1] Train Extra: lr=0.0000146 inv=0.4001562 sub=0.0000000
17-03-26 15:08:36 [1] Step: 105000 Eval acc: 0.67966 0.85369 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 15:08:36 [1] Eval Extra: inv=0.4615172
17-03-26 15:08:36 [1] Checkpointing.
17-03-26 15:09:53 [1] Step: 105100 Acc: 0.71844 0.85931 Cost: 0.99343 0.69239 0.17795 0.12310 Time: 0.00076
17-03-26 15:09:53 [1] Train Extra: lr=0.0000146 inv=0.3810938 sub=0.0000000
17-03-26 15:11:19 [1] Step: 105200 Acc: 0.72281 0.85527 Cost: 0.76698 0.59015 0.05367 0.12316 Time: 0.00080
17-03-26 15:11:19 [1] Train Extra: lr=0.0000145 inv=0.4339062 sub=0.0000000
17-03-26 15:12:52 [1] Step: 105300 Acc: 0.70625 0.84740 Cost: 1.00366 0.63474 0.24576 0.12316 Time: 0.00077
17-03-26 15:12:52 [1] Train Extra: lr=0.0000145 inv=0.4732812 sub=0.0000000
17-03-26 15:14:09 [1] Step: 105400 Acc: 0.72000 0.84821 Cost: 0.86764 0.57961 0.16485 0.12319 Time: 0.00073
17-03-26 15:14:09 [1] Train Extra: lr=0.0000145 inv=0.4142188 sub=0.0000000
17-03-26 15:15:28 [1] Step: 105500 Acc: 0.72875 0.86110 Cost: 0.76096 0.42376 0.21394 0.12325 Time: 0.00077
17-03-26 15:15:28 [1] Train Extra: lr=0.0000144 inv=0.3926562 sub=0.0000000
17-03-26 15:16:26 [1] Step: 105500 Eval acc: 0.68595 0.85233 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 15:16:26 [1] Eval Extra: inv=0.4392668
17-03-26 15:17:50 [1] Step: 105600 Acc: 0.71375 0.85656 Cost: 0.96206 0.68442 0.15435 0.12329 Time: 0.00077
17-03-26 15:17:50 [1] Train Extra: lr=0.0000144 inv=0.4293750 sub=0.0000000
17-03-26 15:19:04 [1] Step: 105700 Acc: 0.71656 0.84949 Cost: 0.99337 0.71725 0.15277 0.12336 Time: 0.00071
17-03-26 15:19:04 [1] Train Extra: lr=0.0000143 inv=0.4150000 sub=0.0000000
17-03-26 15:20:35 [1] Step: 105800 Acc: 0.71625 0.85258 Cost: 0.87341 0.58310 0.16687 0.12345 Time: 0.00078
17-03-26 15:20:35 [1] Train Extra: lr=0.0000143 inv=0.4403125 sub=0.0000000
17-03-26 15:21:47 [1] Step: 105900 Acc: 0.72406 0.85105 Cost: 0.70589 0.51685 0.06553 0.12351 Time: 0.00070
17-03-26 15:21:47 [1] Train Extra: lr=0.0000143 inv=0.4300000 sub=0.0000000
17-03-26 15:23:05 [1] Step: 106000 Acc: 0.71844 0.84774 Cost: 0.98946 0.69307 0.17279 0.12360 Time: 0.00073
17-03-26 15:23:05 [1] Train Extra: lr=0.0000142 inv=0.4451562 sub=0.0000000
17-03-26 15:24:02 [1] Step: 106000 Eval acc: 0.68098 0.84960 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 15:24:02 [1] Eval Extra: inv=0.4574867
17-03-26 15:25:26 [1] Step: 106100 Acc: 0.72000 0.84646 Cost: 0.80700 0.50585 0.17744 0.12371 Time: 0.00074
17-03-26 15:25:26 [1] Train Extra: lr=0.0000142 inv=0.4615625 sub=0.0000000
17-03-26 15:26:41 [1] Step: 106200 Acc: 0.73125 0.84555 Cost: 0.93357 0.62740 0.18241 0.12376 Time: 0.00071
17-03-26 15:26:41 [1] Train Extra: lr=0.0000141 inv=0.4329688 sub=0.0000000
17-03-26 15:28:09 [1] Step: 106300 Acc: 0.70188 0.84664 Cost: 0.93873 0.65051 0.16445 0.12376 Time: 0.00076
17-03-26 15:28:09 [1] Train Extra: lr=0.0000141 inv=0.4348437 sub=0.0000000
17-03-26 15:29:28 [1] Step: 106400 Acc: 0.71562 0.85535 Cost: 1.11536 0.90722 0.08438 0.12376 Time: 0.00077
17-03-26 15:29:28 [1] Train Extra: lr=0.0000141 inv=0.4118750 sub=0.0000000
17-03-26 15:30:54 [1] Step: 106500 Acc: 0.70594 0.85129 Cost: 1.03167 0.68727 0.22061 0.12379 Time: 0.00076
17-03-26 15:30:54 [1] Train Extra: lr=0.0000140 inv=0.4484375 sub=0.0000000
17-03-26 15:31:52 [1] Step: 106500 Eval acc: 0.68408 0.85461 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 15:31:52 [1] Eval Extra: inv=0.4449536
17-03-26 15:33:11 [1] Step: 106600 Acc: 0.71813 0.85346 Cost: 1.19405 0.81103 0.25918 0.12384 Time: 0.00074
17-03-26 15:33:11 [1] Train Extra: lr=0.0000140 inv=0.4048438 sub=0.0000000
17-03-26 15:34:41 [1] Step: 106700 Acc: 0.71062 0.84983 Cost: 0.89370 0.59739 0.17241 0.12391 Time: 0.00078
17-03-26 15:34:41 [1] Train Extra: lr=0.0000139 inv=0.4284375 sub=0.0000000
17-03-26 15:36:14 [1] Step: 106800 Acc: 0.71188 0.84747 Cost: 0.95349 0.61960 0.21003 0.12386 Time: 0.00075
17-03-26 15:36:14 [1] Train Extra: lr=0.0000139 inv=0.4625000 sub=0.0000000
17-03-26 15:37:30 [1] Step: 106900 Acc: 0.72656 0.84939 Cost: 0.83988 0.58614 0.12984 0.12390 Time: 0.00074
17-03-26 15:37:30 [1] Train Extra: lr=0.0000139 inv=0.3942188 sub=0.0000000
17-03-26 15:38:50 [1] Step: 107000 Acc: 0.72219 0.84257 Cost: 1.18046 0.82347 0.23310 0.12389 Time: 0.00073
17-03-26 15:38:50 [1] Train Extra: lr=0.0000138 inv=0.4415625 sub=0.0000000
17-03-26 15:39:48 [1] Step: 107000 Eval acc: 0.68463 0.85448 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 15:39:48 [1] Eval Extra: inv=0.3815702
17-03-26 15:41:08 [1] Step: 107100 Acc: 0.70125 0.84734 Cost: 1.06248 0.69156 0.24693 0.12399 Time: 0.00075
17-03-26 15:41:08 [1] Train Extra: lr=0.0000138 inv=0.4271875 sub=0.0000000
17-03-26 15:42:27 [1] Step: 107200 Acc: 0.71062 0.85636 Cost: 0.93558 0.69230 0.11928 0.12399 Time: 0.00076
17-03-26 15:42:27 [1] Train Extra: lr=0.0000137 inv=0.4250000 sub=0.0000000
17-03-26 15:43:55 [1] Step: 107300 Acc: 0.71313 0.85013 Cost: 1.12180 0.68107 0.31674 0.12399 Time: 0.00076
17-03-26 15:43:55 [1] Train Extra: lr=0.0000137 inv=0.4471875 sub=0.0000000
17-03-26 15:45:20 [1] Step: 107400 Acc: 0.70813 0.84869 Cost: 1.09992 0.69228 0.28356 0.12408 Time: 0.00078
17-03-26 15:45:20 [1] Train Extra: lr=0.0000137 inv=0.4054687 sub=0.0000000
17-03-26 15:46:43 [1] Step: 107500 Acc: 0.72688 0.85381 Cost: 1.09113 0.80495 0.16212 0.12405 Time: 0.00076
17-03-26 15:46:43 [1] Train Extra: lr=0.0000136 inv=0.4160937 sub=0.0000000
17-03-26 15:47:42 [1] Step: 107500 Eval acc: 0.68352 0.85626 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 15:47:42 [1] Eval Extra: inv=0.4132067
17-03-26 15:49:04 [1] Step: 107600 Acc: 0.71031 0.84781 Cost: 0.92293 0.65760 0.14129 0.12404 Time: 0.00074
17-03-26 15:49:04 [1] Train Extra: lr=0.0000136 inv=0.4535938 sub=0.0000000
17-03-26 15:50:20 [1] Step: 107700 Acc: 0.71500 0.84688 Cost: 0.87240 0.51314 0.23520 0.12406 Time: 0.00074
17-03-26 15:50:20 [1] Train Extra: lr=0.0000135 inv=0.4135937 sub=0.0000000
17-03-26 15:51:46 [1] Step: 107800 Acc: 0.72313 0.85348 Cost: 1.14554 0.80518 0.21623 0.12412 Time: 0.00075
17-03-26 15:51:46 [1] Train Extra: lr=0.0000135 inv=0.4293750 sub=0.0000000
17-03-26 15:53:00 [1] Step: 107900 Acc: 0.71531 0.85414 Cost: 1.43633 0.96546 0.34677 0.12410 Time: 0.00072
17-03-26 15:53:00 [1] Train Extra: lr=0.0000135 inv=0.3984375 sub=0.0000000
17-03-26 15:54:25 [1] Step: 108000 Acc: 0.71656 0.84146 Cost: 1.04252 0.64123 0.27713 0.12416 Time: 0.00074
17-03-26 15:54:25 [1] Train Extra: lr=0.0000134 inv=0.4526562 sub=0.0000000
17-03-26 15:55:22 [1] Step: 108000 Eval acc: 0.68341 0.85791 /home/dexter/data/multinli_0.1/multinli_0.1_dev_matched.jsonl Time: 0.00017
17-03-26 15:55:22 [1] Eval Extra: inv=0.3849934
17-03-26 15:56:41 [1] Step: 108100 Acc: 0.71219 0.85214 Cost: 0.80909 0.48194 0.20293 0.12422 Time: 0.00074
17-03-26 15:56:41 [1] Train Extra: lr=0.0000134 inv=0.3892188 sub=0.0000000
17-03-26 15:58:02 [1] Step: 108200 Acc: 0.71500 0.85868 Cost: 1.18406 0.84517 0.21463 0.12426 Time: 0.00075
17-03-26 15:58:02 [1] Train Extra: lr=0.0000133 inv=0.4014063 sub=0.0000000
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment