dnth commited on
Commit
994d60a
·
verified ·
1 Parent(s): 1e6a3e2

Add new CrossEncoder model

Browse files
Files changed (6) hide show
  1. README.md +403 -0
  2. config.json +49 -0
  3. model.safetensors +3 -0
  4. special_tokens_map.json +37 -0
  5. tokenizer.json +0 -0
  6. tokenizer_config.json +945 -0
README.md ADDED
@@ -0,0 +1,403 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - cross-encoder
5
+ - generated_from_trainer
6
+ - dataset_size:1129
7
+ - loss:CrossEntropyLoss
8
+ base_model: answerdotai/ModernBERT-base
9
+ datasets:
10
+ - dnth/cxs-eval-dataset
11
+ pipeline_tag: text-classification
12
+ library_name: sentence-transformers
13
+ metrics:
14
+ - f1_macro
15
+ - f1_micro
16
+ - f1_weighted
17
+ model-index:
18
+ - name: CrossEncoder based on answerdotai/ModernBERT-base
19
+ results:
20
+ - task:
21
+ type: cross-encoder-classification
22
+ name: Cross Encoder Classification
23
+ dataset:
24
+ name: validation set
25
+ type: validation-set
26
+ metrics:
27
+ - type: f1_macro
28
+ value: 0.7480758658603628
29
+ name: F1 Macro
30
+ - type: f1_micro
31
+ value: 0.8156028368794326
32
+ name: F1 Micro
33
+ - type: f1_weighted
34
+ value: 0.8193029448804775
35
+ name: F1 Weighted
36
+ ---
37
+
38
+ # CrossEncoder based on answerdotai/ModernBERT-base
39
+
40
+ This is a [Cross Encoder](https://www.sbert.net/docs/cross_encoder/usage/usage.html) model finetuned from [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the [cxs-eval-dataset](https://huggingface.co/datasets/dnth/cxs-eval-dataset) dataset using the [sentence-transformers](https://www.SBERT.net) library. It computes scores for pairs of texts, which can be used for text pair classification.
41
+
42
+ ## Model Details
43
+
44
+ ### Model Description
45
+ - **Model Type:** Cross Encoder
46
+ - **Base model:** [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) <!-- at revision 8949b909ec900327062f0ebf497f51aef5e6f0c8 -->
47
+ - **Maximum Sequence Length:** 8192 tokens
48
+ - **Number of Output Labels:** 2 labels
49
+ - **Training Dataset:**
50
+ - [cxs-eval-dataset](https://huggingface.co/datasets/dnth/cxs-eval-dataset)
51
+ <!-- - **Language:** Unknown -->
52
+ <!-- - **License:** Unknown -->
53
+
54
+ ### Model Sources
55
+
56
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
57
+ - **Documentation:** [Cross Encoder Documentation](https://www.sbert.net/docs/cross_encoder/usage/usage.html)
58
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
59
+ - **Hugging Face:** [Cross Encoders on Hugging Face](https://huggingface.co/models?library=sentence-transformers&other=cross-encoder)
60
+
61
+ ## Usage
62
+
63
+ ### Direct Usage (Sentence Transformers)
64
+
65
+ First install the Sentence Transformers library:
66
+
67
+ ```bash
68
+ pip install -U sentence-transformers
69
+ ```
70
+
71
+ Then you can load this model and run inference.
72
+ ```python
73
+ from sentence_transformers import CrossEncoder
74
+
75
+ # Download from the 🤗 Hub
76
+ model = CrossEncoder("dnth/cxs-eval-modernbert-base-classifier")
77
+ # Get scores for pairs of texts
78
+ pairs = [
79
+ ["Inflight Quality Assurance Inspector - As an Inflight Quality Assurance Inspector at Santan, you are tasked with ensuring the accuracy and efficiency of meal and product handling for inflight services. Your responsibilities include verifying cart quantities, monitoring picking and staging processes, and maintaining inventory through CMS and WMS systems. You'll oversee packers and warehouse agents, manage stock adjustments and wastage reporting, and ensure compliance with FIFO and quality assurance standards. Key tasks include verifying outbound and inbound cart quantities, ensuring accurate meal picking, confirming seal and flight details in CMS, performing inventory counts, monitoring product packing, and facilitating effective communication between all parties involved. Additionally, you'll manage stock movements, update supervisors on stock issues, and ensure timely mass production by Packers or Warehouse Agents. - Air Transport", 'Manager (Cargo) - The Manager (Cargo) leads the development of operational strategies to improve the effectiveness of air cargo and warehouse operations. He/She is responsible for driving continuous improvement and business development initiatives to optimise processes and meet customer needs. He develops Standard Operating Procedures (SOPs) and systems to mitigate safety and/or security risks and monitors the adherence to safety and/or security standards. He also advises authorities and other teams to manage incidents involving hazardous materials and/or dangerous goods. He develops the teams technical capabilities and maintains positive morale within the teams.\n\nThe Manager (Cargo) has an extensive knowledge of cargo handling procedures, international airfreight regulations and policy requirements. He is also well-versed in organisational safety regulations and practices. He possesses strong interpersonal and stakeholder management skills in order to build and maintain stakeholder relationships. In addition, he is adaptable and manages stressful situations within a tight time frame. He also has strong communication and people management skills with outstanding computer literacy and problem-solving skills.'],
80
+ ["Manager, Pricing Systems - Manage the systems and tools that power our pricing and revenue management strategies. As Manager of Pricing Systems, you will oversee the implementation, maintenance, and enhancement of our pricing engines and related software. You'll work with IT and commercial teams to ensure system integrity, support new pricing initiatives, and troubleshoot issues. Technical expertise in airline pricing systems is a must. - Air Transport", 'Terminal Manager - The Terminal Manager is responsible for planning the delivery of airport services and implementing projects to improve the airport user experience. He/She collaborates with internal and external stakeholders to develop and establish compliance standards for airport operations. Besides leading research to identify new service offerings for the airport, he also enforces safety and/or security standards for the organisation. He sources for contractors to perform airport operations and establishes key indicators to track their performances. He also develops on-the-job training programmes for the department.\n\nThe Terminal Manager possesses a sound knowledge of airport and airside operations, Standard Operating Procedures (SOPs) and airport security and emergency response plans. He travels across the terminals to review operational processes and resolve incidents at the airport. To ensure operational plans and projects are implemented efficiently and effectively, he displays excellent project management skills with strong communication and stakeholder management skills to interact and engage all customers and stakeholders of the organisation.'],
81
+ ['Executive, Inflight Operation Team Lead - As an Executive, Inflight Operation Team Lead, you will excel in leading the Hi-Lift Driver team by demonstrating expertise in operational leadership while upholding high standards of safety and quality. You will contribute significantly to departmental and company objectives, ensuring strict adherence to industry regulations and operational procedures. Your role champions driving excellence in operational and engineering tasks, supporting initiatives to boost efficiency, and collaborating with cross-functional teams to achieve organizational goals, all while ensuring compliance with regulatory and organizational standards. - Air Transport', 'Executive (Ramp / Technical Ramp) - The Executive (Ramp/Technical Ramp) performs regular audits to ensure that ramp/technical ramp operations comply with Standard Operating Procedures (SOPs). He/She recommends enhancements to address areas of non-compliance, improves operational efficiencies and develops action plans to prevent safety and/or security issues from recurring. He also develops workplace learning plans and on-the-job training programmes to enhance the capabilities of teams.\n \nThe Executive (Ramp/Technical Ramp) has a sound knowledge of airport operations and ramp/technical ramp procedures. He also has strong analytical abilities to identify root causes of ground handling related delays and their potential implications. He is results-oriented and possesses excellent communication and interpersonal skills. Furthermore, he has strong computer literacy skills and competence in learning orientation to determine learning and developmental needs of his team members.'],
82
+ ['Executive, Customs & Logistics - As an Executive, Customs & Logistics, you will showcase your expertise as an Inflight Customs Officer by ensuring high safety and quality standards in all assignments. Your role involves contributing to the department and company objectives while adhering to industry regulations and operational procedures. You will champion excellence in operational or engineering tasks, support initiatives that enhance operational efficiency, and collaborate with cross-functional teams to achieve organizational goals, maintaining compliance with all regulatory and organizational standards. - Air Transport', 'Executive (Load Control) - The Executive (Load Control) determines load sheet requirements with airlines and develops manpower plans to support load control operations. He/She performs regular audits to ensure that the calculation of load limits, distribution and flight performance data complies with Standard Operating Procedures (SOPs). He also recommends enhancements to address areas of non-compliance and improves operational efficiencies. He develops SOPs to ensure alignment with safety and regulatory requirements, and is responsible for manpower deployment, developing on-the-job training programmes and workplace learning plans.\n\nBesides holding an Airlines Load and Trim Certification to perform his duties, the Executive (Load Control) also possesses outstanding data management and analytical skills. Moreover, he is comfortable with number crunching and is resilient to unforeseen interruptions and stressful situations. He also has excellent verbal and written communication skills and strong people management skills in order to develop effective training programmes and learning plans for the teams.'],
83
+ ['Quality Assurance Engineer - As a Quality Assurance Engineer in the aviation sector, you will conduct audits and inspections to ensure adherence to safety standards, review maintenance and operational processes, and assist in quality issue investigations. Your role includes supporting safety management systems, promoting continuous quality improvement, ensuring timely resolution of audit findings, and coordinating with regulatory bodies. You will also maintain up-to-date documentation of all quality assurance activities. - Aerospace', "Technician (Assembly) - The Technician (Assembly) performs assembly tasks for aircraft components in accordance with technical manuals and standard operating procedures (SOPs). He/She operates workshop equipment, tools and machines for the assembly of aircraft components. He also keeps abreast of latest developments of related systems by updating himself through relevant manuals and other publications. He may be authorised by the organisation to perform quality control functions, including inspection of incoming materials and assembled components and parts, and registration of non-conformances. He may also be authorised to perform level 1 non-destructive testing (NDT) functions under supervision, evaluate for acceptance or rejection, and record results as specified in the work instructions. He complies with airworthiness and legislative requirements, and the organisation's safety, health and quality systems. He supports in implementation of continuous improvement initiatives and lean practices. He works in a hangar or workshop and may be required to work in shifts. He should be systematic and detail-oriented, and able to work independently and in a team to accomplish assigned tasks."],
84
+ ]
85
+ scores = model.predict(pairs)
86
+ print(scores.shape)
87
+ # (5, 2)
88
+ ```
89
+
90
+ <!--
91
+ ### Direct Usage (Transformers)
92
+
93
+ <details><summary>Click to see the direct usage in Transformers</summary>
94
+
95
+ </details>
96
+ -->
97
+
98
+ <!--
99
+ ### Downstream Usage (Sentence Transformers)
100
+
101
+ You can finetune this model on your own dataset.
102
+
103
+ <details><summary>Click to expand</summary>
104
+
105
+ </details>
106
+ -->
107
+
108
+ <!--
109
+ ### Out-of-Scope Use
110
+
111
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
112
+ -->
113
+
114
+ ## Evaluation
115
+
116
+ ### Metrics
117
+
118
+ #### Cross Encoder Classification
119
+
120
+ * Dataset: `validation-set`
121
+ * Evaluated with [<code>CrossEncoderClassificationEvaluator</code>](https://sbert.net/docs/package_reference/cross_encoder/evaluation.html#sentence_transformers.cross_encoder.evaluation.CrossEncoderClassificationEvaluator)
122
+
123
+ | Metric | Value |
124
+ |:-------------|:-----------|
125
+ | **f1_macro** | **0.7481** |
126
+ | f1_micro | 0.8156 |
127
+ | f1_weighted | 0.8193 |
128
+
129
+ <!--
130
+ ## Bias, Risks and Limitations
131
+
132
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
133
+ -->
134
+
135
+ <!--
136
+ ### Recommendations
137
+
138
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
139
+ -->
140
+
141
+ ## Training Details
142
+
143
+ ### Training Dataset
144
+
145
+ #### cxs-eval-dataset
146
+
147
+ * Dataset: [cxs-eval-dataset](https://huggingface.co/datasets/dnth/cxs-eval-dataset) at [a3a8a16](https://huggingface.co/datasets/dnth/cxs-eval-dataset/tree/a3a8a167ee1ca5fd48f6c99cff7b61fc756fbb3f)
148
+ * Size: 1,129 training samples
149
+ * Columns: <code>query</code>, <code>result</code>, and <code>label</code>
150
+ * Approximate statistics based on the first 1000 samples:
151
+ | | query | result | label |
152
+ |:--------|:---------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------|:------------------------------------------------|
153
+ | type | string | string | int |
154
+ | details | <ul><li>min: 193 characters</li><li>mean: 661.12 characters</li><li>max: 2255 characters</li></ul> | <ul><li>min: 556 characters</li><li>mean: 1161.45 characters</li><li>max: 1836 characters</li></ul> | <ul><li>0: ~76.70%</li><li>1: ~23.30%</li></ul> |
155
+ * Samples:
156
+ | query | result | label |
157
+ |:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
158
+ | <code>Executive, Inflight Operation Team Lead - As an Executive, Inflight Operation Team Lead, you will excel in leading the Hi-Lift Driver team by demonstrating expertise in operational leadership while upholding high standards of safety and quality. You will contribute significantly to departmental and company objectives, ensuring strict adherence to industry regulations and operational procedures. Your role champions driving excellence in operational and engineering tasks, supporting initiatives to boost efficiency, and collaborating with cross-functional teams to achieve organizational goals, all while ensuring compliance with regulatory and organizational standards. - Air Transport</code> | <code>Executive (Baggage Services) - The Executive (Baggage Services) is the main point of contact for airline representatives, authorities and other departments to determine baggage handling standards and manpower requirements for baggage handling operations. He/She develops effective plans to mitigate compliance failures. He collates statistics and reports on baggage handling failures and is in charge of implementing new procedures to reduce these occurrences. He enforces a safety and security culture in the workplace and addresses internal or external audit issues. He is responsible for developing on-the-job training programmes and workplace learning plans.<br><br>A sound knowledge of airport operations, baggage handling procedures and associated administrative procedures to process requests and documentation is expected of the Executive (Baggage Services). He has good stakeholder management skills to handle internal and external stakeholders effectively. He is adept at customer service to hand...</code> | <code>0</code> |
159
+ | <code>VP of Passenger Experience - Customer Experience Strategy Development: Define and implement long-term customer service strategies aligned with brand positioning, regulatory requirements, and evolving passenger expectations.<br><br>Leadership and Department Oversight: Lead cross-functional teams across reservations, in-flight services, airport staff, and support centers to ensure unified and consistent service delivery across all locations.<br><br>Performance and Quality Management: Monitor customer service KPIs (NPS, CSAT, FCR), oversee service audits, and implement improvements based on passenger feedback and competitive benchmarking. - Air Transport</code> | <code>Vice President - The Vice President formulates strategies for passenger services operations to ensure performance levels are aligned with evolving customer needs, technological advancements, airline brand identity and business objectives. He/She is responsible for spearheading continuous improvement and business development initiatives to improve the profitability of the business. He leads the development of Standard Operating Procedures (SOPs) and systems to mitigate safety and/or security risks and plays a pivotal role in influencing organisational development by making key decisions on human capital needs.<br><br>The Vice President for Passenger Services possesses exceptional leadership and stakeholder management skills to cultivate local and international relationships with senior stakeholders in the industry and further the business objectives of the organisation. Furthermore, he has an in-depth knowledge of the airports organisational structures, ground handling operations and safety a...</code> | <code>1</code> |
160
+ | <code>Guest Services Assistant - As a Guest Services Assistant, you will uphold the highest standards of safety, security, and service in our ground operations. Your responsibilities will include verifying documentation through passport checks and ID matches, managing reservations, and issuing boarding passes. You will ensure smooth luggage tagging and inspections, oversee guest boarding and deplaning processes, and provide specialized assistance for guests with special needs, such as wheelchair services. Handling important documents like GD and crew sign-ins, as well as collecting immigration cards, is essential. You must also conduct the mandatory security questionnaire for passengers checking in luggage. Compliance with airline standards, safety protocols, and applicable laws in all operational areas is paramount. - Air Transport</code> | <code>Executive (Ground Services) - The Executive (Ground Services) oversees activities relating to the audit of ground service standards and develops partnerships with a wide range of audiences and stakeholders. He/She reviews service standard agreements and develops action plans to improve service efficiency for the airline. He conducts rate reviews for ground handlers regarding pricing, service quality and scope of work. In addition, he also plans cost estimates for contracts with ground handling organisations. He proposes process changes to improve passenger safety and/or security standards. He contributes to organisational development by developing on-the-job training programmes and workplace learning plans.<br><br>With foresight and highly developed analytical capabilities, the Executive (Ground Services) is able to develop action plans and provide recommendations to address any identified service delivery gaps. He develops strong rapport with the stakeholders by recognising their interests ...</code> | <code>0</code> |
161
+ * Loss: [<code>CrossEntropyLoss</code>](https://sbert.net/docs/package_reference/cross_encoder/losses.html#crossentropyloss)
162
+
163
+ ### Evaluation Dataset
164
+
165
+ #### cxs-eval-dataset
166
+
167
+ * Dataset: [cxs-eval-dataset](https://huggingface.co/datasets/dnth/cxs-eval-dataset) at [a3a8a16](https://huggingface.co/datasets/dnth/cxs-eval-dataset/tree/a3a8a167ee1ca5fd48f6c99cff7b61fc756fbb3f)
168
+ * Size: 141 evaluation samples
169
+ * Columns: <code>query</code>, <code>result</code>, and <code>label</code>
170
+ * Approximate statistics based on the first 141 samples:
171
+ | | query | result | label |
172
+ |:--------|:---------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------|:------------------------------------------------|
173
+ | type | string | string | int |
174
+ | details | <ul><li>min: 193 characters</li><li>mean: 655.76 characters</li><li>max: 2255 characters</li></ul> | <ul><li>min: 556 characters</li><li>mean: 1091.27 characters</li><li>max: 1810 characters</li></ul> | <ul><li>0: ~77.30%</li><li>1: ~22.70%</li></ul> |
175
+ * Samples:
176
+ | query | result | label |
177
+ |:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
178
+ | <code>Inflight Quality Assurance Inspector - As an Inflight Quality Assurance Inspector at Santan, you are tasked with ensuring the accuracy and efficiency of meal and product handling for inflight services. Your responsibilities include verifying cart quantities, monitoring picking and staging processes, and maintaining inventory through CMS and WMS systems. You'll oversee packers and warehouse agents, manage stock adjustments and wastage reporting, and ensure compliance with FIFO and quality assurance standards. Key tasks include verifying outbound and inbound cart quantities, ensuring accurate meal picking, confirming seal and flight details in CMS, performing inventory counts, monitoring product packing, and facilitating effective communication between all parties involved. Additionally, you'll manage stock movements, update supervisors on stock issues, and ensure timely mass production by Packers or Warehouse Agents. - Air Transport</code> | <code>Manager (Cargo) - The Manager (Cargo) leads the development of operational strategies to improve the effectiveness of air cargo and warehouse operations. He/She is responsible for driving continuous improvement and business development initiatives to optimise processes and meet customer needs. He develops Standard Operating Procedures (SOPs) and systems to mitigate safety and/or security risks and monitors the adherence to safety and/or security standards. He also advises authorities and other teams to manage incidents involving hazardous materials and/or dangerous goods. He develops the teams technical capabilities and maintains positive morale within the teams.<br><br>The Manager (Cargo) has an extensive knowledge of cargo handling procedures, international airfreight regulations and policy requirements. He is also well-versed in organisational safety regulations and practices. He possesses strong interpersonal and stakeholder management skills in order to build and maintain stakeholder re...</code> | <code>0</code> |
179
+ | <code>Manager, Pricing Systems - Manage the systems and tools that power our pricing and revenue management strategies. As Manager of Pricing Systems, you will oversee the implementation, maintenance, and enhancement of our pricing engines and related software. You'll work with IT and commercial teams to ensure system integrity, support new pricing initiatives, and troubleshoot issues. Technical expertise in airline pricing systems is a must. - Air Transport</code> | <code>Terminal Manager - The Terminal Manager is responsible for planning the delivery of airport services and implementing projects to improve the airport user experience. He/She collaborates with internal and external stakeholders to develop and establish compliance standards for airport operations. Besides leading research to identify new service offerings for the airport, he also enforces safety and/or security standards for the organisation. He sources for contractors to perform airport operations and establishes key indicators to track their performances. He also develops on-the-job training programmes for the department.<br><br>The Terminal Manager possesses a sound knowledge of airport and airside operations, Standard Operating Procedures (SOPs) and airport security and emergency response plans. He travels across the terminals to review operational processes and resolve incidents at the airport. To ensure operational plans and projects are implemented efficiently and effectively, he displa...</code> | <code>0</code> |
180
+ | <code>Executive, Inflight Operation Team Lead - As an Executive, Inflight Operation Team Lead, you will excel in leading the Hi-Lift Driver team by demonstrating expertise in operational leadership while upholding high standards of safety and quality. You will contribute significantly to departmental and company objectives, ensuring strict adherence to industry regulations and operational procedures. Your role champions driving excellence in operational and engineering tasks, supporting initiatives to boost efficiency, and collaborating with cross-functional teams to achieve organizational goals, all while ensuring compliance with regulatory and organizational standards. - Air Transport</code> | <code>Executive (Ramp / Technical Ramp) - The Executive (Ramp/Technical Ramp) performs regular audits to ensure that ramp/technical ramp operations comply with Standard Operating Procedures (SOPs). He/She recommends enhancements to address areas of non-compliance, improves operational efficiencies and develops action plans to prevent safety and/or security issues from recurring. He also develops workplace learning plans and on-the-job training programmes to enhance the capabilities of teams.<br> <br>The Executive (Ramp/Technical Ramp) has a sound knowledge of airport operations and ramp/technical ramp procedures. He also has strong analytical abilities to identify root causes of ground handling related delays and their potential implications. He is results-oriented and possesses excellent communication and interpersonal skills. Furthermore, he has strong computer literacy skills and competence in learning orientation to determine learning and developmental needs of his team members.</code> | <code>0</code> |
181
+ * Loss: [<code>CrossEntropyLoss</code>](https://sbert.net/docs/package_reference/cross_encoder/losses.html#crossentropyloss)
182
+
183
+ ### Training Hyperparameters
184
+ #### Non-Default Hyperparameters
185
+
186
+ - `eval_strategy`: epoch
187
+ - `per_device_train_batch_size`: 16
188
+ - `per_device_eval_batch_size`: 32
189
+ - `learning_rate`: 4e-05
190
+ - `weight_decay`: 0.01
191
+ - `num_train_epochs`: 10
192
+ - `warmup_ratio`: 0.1
193
+ - `bf16`: True
194
+ - `load_best_model_at_end`: True
195
+
196
+ #### All Hyperparameters
197
+ <details><summary>Click to expand</summary>
198
+
199
+ - `overwrite_output_dir`: False
200
+ - `do_predict`: False
201
+ - `eval_strategy`: epoch
202
+ - `prediction_loss_only`: True
203
+ - `per_device_train_batch_size`: 16
204
+ - `per_device_eval_batch_size`: 32
205
+ - `per_gpu_train_batch_size`: None
206
+ - `per_gpu_eval_batch_size`: None
207
+ - `gradient_accumulation_steps`: 1
208
+ - `eval_accumulation_steps`: None
209
+ - `torch_empty_cache_steps`: None
210
+ - `learning_rate`: 4e-05
211
+ - `weight_decay`: 0.01
212
+ - `adam_beta1`: 0.9
213
+ - `adam_beta2`: 0.999
214
+ - `adam_epsilon`: 1e-08
215
+ - `max_grad_norm`: 1.0
216
+ - `num_train_epochs`: 10
217
+ - `max_steps`: -1
218
+ - `lr_scheduler_type`: linear
219
+ - `lr_scheduler_kwargs`: {}
220
+ - `warmup_ratio`: 0.1
221
+ - `warmup_steps`: 0
222
+ - `log_level`: passive
223
+ - `log_level_replica`: warning
224
+ - `log_on_each_node`: True
225
+ - `logging_nan_inf_filter`: True
226
+ - `save_safetensors`: True
227
+ - `save_on_each_node`: False
228
+ - `save_only_model`: False
229
+ - `restore_callback_states_from_checkpoint`: False
230
+ - `no_cuda`: False
231
+ - `use_cpu`: False
232
+ - `use_mps_device`: False
233
+ - `seed`: 42
234
+ - `data_seed`: None
235
+ - `jit_mode_eval`: False
236
+ - `use_ipex`: False
237
+ - `bf16`: True
238
+ - `fp16`: False
239
+ - `fp16_opt_level`: O1
240
+ - `half_precision_backend`: auto
241
+ - `bf16_full_eval`: False
242
+ - `fp16_full_eval`: False
243
+ - `tf32`: None
244
+ - `local_rank`: 0
245
+ - `ddp_backend`: None
246
+ - `tpu_num_cores`: None
247
+ - `tpu_metrics_debug`: False
248
+ - `debug`: []
249
+ - `dataloader_drop_last`: False
250
+ - `dataloader_num_workers`: 0
251
+ - `dataloader_prefetch_factor`: None
252
+ - `past_index`: -1
253
+ - `disable_tqdm`: False
254
+ - `remove_unused_columns`: True
255
+ - `label_names`: None
256
+ - `load_best_model_at_end`: True
257
+ - `ignore_data_skip`: False
258
+ - `fsdp`: []
259
+ - `fsdp_min_num_params`: 0
260
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
261
+ - `fsdp_transformer_layer_cls_to_wrap`: None
262
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
263
+ - `deepspeed`: None
264
+ - `label_smoothing_factor`: 0.0
265
+ - `optim`: adamw_torch
266
+ - `optim_args`: None
267
+ - `adafactor`: False
268
+ - `group_by_length`: False
269
+ - `length_column_name`: length
270
+ - `ddp_find_unused_parameters`: None
271
+ - `ddp_bucket_cap_mb`: None
272
+ - `ddp_broadcast_buffers`: False
273
+ - `dataloader_pin_memory`: True
274
+ - `dataloader_persistent_workers`: False
275
+ - `skip_memory_metrics`: True
276
+ - `use_legacy_prediction_loop`: False
277
+ - `push_to_hub`: False
278
+ - `resume_from_checkpoint`: None
279
+ - `hub_model_id`: None
280
+ - `hub_strategy`: every_save
281
+ - `hub_private_repo`: None
282
+ - `hub_always_push`: False
283
+ - `hub_revision`: None
284
+ - `gradient_checkpointing`: False
285
+ - `gradient_checkpointing_kwargs`: None
286
+ - `include_inputs_for_metrics`: False
287
+ - `include_for_metrics`: []
288
+ - `eval_do_concat_batches`: True
289
+ - `fp16_backend`: auto
290
+ - `push_to_hub_model_id`: None
291
+ - `push_to_hub_organization`: None
292
+ - `mp_parameters`:
293
+ - `auto_find_batch_size`: False
294
+ - `full_determinism`: False
295
+ - `torchdynamo`: None
296
+ - `ray_scope`: last
297
+ - `ddp_timeout`: 1800
298
+ - `torch_compile`: False
299
+ - `torch_compile_backend`: None
300
+ - `torch_compile_mode`: None
301
+ - `include_tokens_per_second`: False
302
+ - `include_num_input_tokens_seen`: False
303
+ - `neftune_noise_alpha`: None
304
+ - `optim_target_modules`: None
305
+ - `batch_eval_metrics`: False
306
+ - `eval_on_start`: False
307
+ - `use_liger_kernel`: False
308
+ - `liger_kernel_config`: None
309
+ - `eval_use_gather_object`: False
310
+ - `average_tokens_across_devices`: False
311
+ - `prompts`: None
312
+ - `batch_sampler`: batch_sampler
313
+ - `multi_dataset_batch_sampler`: proportional
314
+
315
+ </details>
316
+
317
+ ### Training Logs
318
+ | Epoch | Step | Training Loss | Validation Loss | validation-set_f1_macro |
319
+ |:-------:|:-------:|:-------------:|:---------------:|:-----------------------:|
320
+ | 0.3521 | 25 | 0.5365 | - | - |
321
+ | 0.7042 | 50 | 0.5596 | - | - |
322
+ | 1.0 | 71 | - | 0.5498 | 0.5324 |
323
+ | 1.0563 | 75 | 0.5252 | - | - |
324
+ | 1.4085 | 100 | 0.4624 | - | - |
325
+ | 1.7606 | 125 | 0.4534 | - | - |
326
+ | 2.0 | 142 | - | 0.5547 | 0.4649 |
327
+ | 2.1127 | 150 | 0.4883 | - | - |
328
+ | 2.4648 | 175 | 0.3718 | - | - |
329
+ | 2.8169 | 200 | 0.4136 | - | - |
330
+ | 3.0 | 213 | - | 0.4693 | 0.5576 |
331
+ | 3.1690 | 225 | 0.3431 | - | - |
332
+ | 3.5211 | 250 | 0.2573 | - | - |
333
+ | 3.8732 | 275 | 0.3497 | - | - |
334
+ | 4.0 | 284 | - | 0.3733 | 0.7594 |
335
+ | 4.2254 | 300 | 0.2694 | - | - |
336
+ | 4.5775 | 325 | 0.2604 | - | - |
337
+ | 4.9296 | 350 | 0.1834 | - | - |
338
+ | 5.0 | 355 | - | 0.5547 | 0.7321 |
339
+ | 5.2817 | 375 | 0.1862 | - | - |
340
+ | 5.6338 | 400 | 0.1219 | - | - |
341
+ | 5.9859 | 425 | 0.1605 | - | - |
342
+ | 6.0 | 426 | - | 0.7399 | 0.7177 |
343
+ | 6.3380 | 450 | 0.1093 | - | - |
344
+ | 6.6901 | 475 | 0.1003 | - | - |
345
+ | 7.0 | 497 | - | 1.2497 | 0.7320 |
346
+ | 7.0423 | 500 | 0.0885 | - | - |
347
+ | 7.3944 | 525 | 0.0698 | - | - |
348
+ | 7.7465 | 550 | 0.0627 | - | - |
349
+ | **8.0** | **568** | **-** | **1.2357** | **0.7749** |
350
+ | 8.0986 | 575 | 0.0088 | - | - |
351
+ | 8.4507 | 600 | 0.0001 | - | - |
352
+ | 8.8028 | 625 | 0.0296 | - | - |
353
+ | 9.0 | 639 | - | 1.6408 | 0.7530 |
354
+ | 9.1549 | 650 | 0.0098 | - | - |
355
+ | 9.5070 | 675 | 0.0124 | - | - |
356
+ | 9.8592 | 700 | 0.0012 | - | - |
357
+ | 10.0 | 710 | - | 1.6398 | 0.7481 |
358
+
359
+ * The bold row denotes the saved checkpoint.
360
+
361
+ ### Framework Versions
362
+ - Python: 3.12.11
363
+ - Sentence Transformers: 4.1.0
364
+ - Transformers: 4.53.0
365
+ - PyTorch: 2.7.1+cu126
366
+ - Accelerate: 1.8.1
367
+ - Datasets: 3.6.0
368
+ - Tokenizers: 0.21.2
369
+
370
+ ## Citation
371
+
372
+ ### BibTeX
373
+
374
+ #### Sentence Transformers
375
+ ```bibtex
376
+ @inproceedings{reimers-2019-sentence-bert,
377
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
378
+ author = "Reimers, Nils and Gurevych, Iryna",
379
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
380
+ month = "11",
381
+ year = "2019",
382
+ publisher = "Association for Computational Linguistics",
383
+ url = "https://arxiv.org/abs/1908.10084",
384
+ }
385
+ ```
386
+
387
+ <!--
388
+ ## Glossary
389
+
390
+ *Clearly define terms in order to be accessible across audiences.*
391
+ -->
392
+
393
+ <!--
394
+ ## Model Card Authors
395
+
396
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
397
+ -->
398
+
399
+ <!--
400
+ ## Model Card Contact
401
+
402
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
403
+ -->
config.json ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "ModernBertForSequenceClassification"
4
+ ],
5
+ "attention_bias": false,
6
+ "attention_dropout": 0.0,
7
+ "bos_token_id": 50281,
8
+ "classifier_activation": "gelu",
9
+ "classifier_bias": false,
10
+ "classifier_dropout": 0.0,
11
+ "classifier_pooling": "mean",
12
+ "cls_token_id": 50281,
13
+ "decoder_bias": true,
14
+ "deterministic_flash_attn": false,
15
+ "embedding_dropout": 0.0,
16
+ "eos_token_id": 50282,
17
+ "global_attn_every_n_layers": 3,
18
+ "global_rope_theta": 160000.0,
19
+ "gradient_checkpointing": false,
20
+ "hidden_activation": "gelu",
21
+ "hidden_size": 768,
22
+ "initializer_cutoff_factor": 2.0,
23
+ "initializer_range": 0.02,
24
+ "intermediate_size": 1152,
25
+ "layer_norm_eps": 1e-05,
26
+ "local_attention": 128,
27
+ "local_rope_theta": 10000.0,
28
+ "max_position_embeddings": 8192,
29
+ "mlp_bias": false,
30
+ "mlp_dropout": 0.0,
31
+ "model_type": "modernbert",
32
+ "norm_bias": false,
33
+ "norm_eps": 1e-05,
34
+ "num_attention_heads": 12,
35
+ "num_hidden_layers": 22,
36
+ "pad_token_id": 50283,
37
+ "position_embedding_type": "absolute",
38
+ "repad_logits_with_grad": false,
39
+ "sentence_transformers": {
40
+ "activation_fn": "torch.nn.modules.linear.Identity",
41
+ "version": "4.1.0"
42
+ },
43
+ "sep_token_id": 50282,
44
+ "sparse_pred_ignore_index": -100,
45
+ "sparse_prediction": false,
46
+ "torch_dtype": "float32",
47
+ "transformers_version": "4.53.0",
48
+ "vocab_size": 50368
49
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3d0018266efbf34818d0deb715a25dfcd75f1313d2194a47a39e64bfa98458b2
3
+ size 598439784
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": true,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,945 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "|||IP_ADDRESS|||",
5
+ "lstrip": false,
6
+ "normalized": true,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": false
10
+ },
11
+ "1": {
12
+ "content": "<|padding|>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "50254": {
20
+ "content": " ",
21
+ "lstrip": false,
22
+ "normalized": true,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": false
26
+ },
27
+ "50255": {
28
+ "content": " ",
29
+ "lstrip": false,
30
+ "normalized": true,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": false
34
+ },
35
+ "50256": {
36
+ "content": " ",
37
+ "lstrip": false,
38
+ "normalized": true,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": false
42
+ },
43
+ "50257": {
44
+ "content": " ",
45
+ "lstrip": false,
46
+ "normalized": true,
47
+ "rstrip": false,
48
+ "single_word": false,
49
+ "special": false
50
+ },
51
+ "50258": {
52
+ "content": " ",
53
+ "lstrip": false,
54
+ "normalized": true,
55
+ "rstrip": false,
56
+ "single_word": false,
57
+ "special": false
58
+ },
59
+ "50259": {
60
+ "content": " ",
61
+ "lstrip": false,
62
+ "normalized": true,
63
+ "rstrip": false,
64
+ "single_word": false,
65
+ "special": false
66
+ },
67
+ "50260": {
68
+ "content": " ",
69
+ "lstrip": false,
70
+ "normalized": true,
71
+ "rstrip": false,
72
+ "single_word": false,
73
+ "special": false
74
+ },
75
+ "50261": {
76
+ "content": " ",
77
+ "lstrip": false,
78
+ "normalized": true,
79
+ "rstrip": false,
80
+ "single_word": false,
81
+ "special": false
82
+ },
83
+ "50262": {
84
+ "content": " ",
85
+ "lstrip": false,
86
+ "normalized": true,
87
+ "rstrip": false,
88
+ "single_word": false,
89
+ "special": false
90
+ },
91
+ "50263": {
92
+ "content": " ",
93
+ "lstrip": false,
94
+ "normalized": true,
95
+ "rstrip": false,
96
+ "single_word": false,
97
+ "special": false
98
+ },
99
+ "50264": {
100
+ "content": " ",
101
+ "lstrip": false,
102
+ "normalized": true,
103
+ "rstrip": false,
104
+ "single_word": false,
105
+ "special": false
106
+ },
107
+ "50265": {
108
+ "content": " ",
109
+ "lstrip": false,
110
+ "normalized": true,
111
+ "rstrip": false,
112
+ "single_word": false,
113
+ "special": false
114
+ },
115
+ "50266": {
116
+ "content": " ",
117
+ "lstrip": false,
118
+ "normalized": true,
119
+ "rstrip": false,
120
+ "single_word": false,
121
+ "special": false
122
+ },
123
+ "50267": {
124
+ "content": " ",
125
+ "lstrip": false,
126
+ "normalized": true,
127
+ "rstrip": false,
128
+ "single_word": false,
129
+ "special": false
130
+ },
131
+ "50268": {
132
+ "content": " ",
133
+ "lstrip": false,
134
+ "normalized": true,
135
+ "rstrip": false,
136
+ "single_word": false,
137
+ "special": false
138
+ },
139
+ "50269": {
140
+ "content": " ",
141
+ "lstrip": false,
142
+ "normalized": true,
143
+ "rstrip": false,
144
+ "single_word": false,
145
+ "special": false
146
+ },
147
+ "50270": {
148
+ "content": " ",
149
+ "lstrip": false,
150
+ "normalized": true,
151
+ "rstrip": false,
152
+ "single_word": false,
153
+ "special": false
154
+ },
155
+ "50271": {
156
+ "content": " ",
157
+ "lstrip": false,
158
+ "normalized": true,
159
+ "rstrip": false,
160
+ "single_word": false,
161
+ "special": false
162
+ },
163
+ "50272": {
164
+ "content": " ",
165
+ "lstrip": false,
166
+ "normalized": true,
167
+ "rstrip": false,
168
+ "single_word": false,
169
+ "special": false
170
+ },
171
+ "50273": {
172
+ "content": " ",
173
+ "lstrip": false,
174
+ "normalized": true,
175
+ "rstrip": false,
176
+ "single_word": false,
177
+ "special": false
178
+ },
179
+ "50274": {
180
+ "content": " ",
181
+ "lstrip": false,
182
+ "normalized": true,
183
+ "rstrip": false,
184
+ "single_word": false,
185
+ "special": false
186
+ },
187
+ "50275": {
188
+ "content": " ",
189
+ "lstrip": false,
190
+ "normalized": true,
191
+ "rstrip": false,
192
+ "single_word": false,
193
+ "special": false
194
+ },
195
+ "50276": {
196
+ "content": " ",
197
+ "lstrip": false,
198
+ "normalized": true,
199
+ "rstrip": false,
200
+ "single_word": false,
201
+ "special": false
202
+ },
203
+ "50277": {
204
+ "content": "|||EMAIL_ADDRESS|||",
205
+ "lstrip": false,
206
+ "normalized": true,
207
+ "rstrip": false,
208
+ "single_word": false,
209
+ "special": false
210
+ },
211
+ "50278": {
212
+ "content": "|||PHONE_NUMBER|||",
213
+ "lstrip": false,
214
+ "normalized": true,
215
+ "rstrip": false,
216
+ "single_word": false,
217
+ "special": false
218
+ },
219
+ "50279": {
220
+ "content": "<|endoftext|>",
221
+ "lstrip": false,
222
+ "normalized": false,
223
+ "rstrip": false,
224
+ "single_word": false,
225
+ "special": true
226
+ },
227
+ "50280": {
228
+ "content": "[UNK]",
229
+ "lstrip": false,
230
+ "normalized": false,
231
+ "rstrip": false,
232
+ "single_word": false,
233
+ "special": true
234
+ },
235
+ "50281": {
236
+ "content": "[CLS]",
237
+ "lstrip": false,
238
+ "normalized": false,
239
+ "rstrip": false,
240
+ "single_word": false,
241
+ "special": true
242
+ },
243
+ "50282": {
244
+ "content": "[SEP]",
245
+ "lstrip": false,
246
+ "normalized": false,
247
+ "rstrip": false,
248
+ "single_word": false,
249
+ "special": true
250
+ },
251
+ "50283": {
252
+ "content": "[PAD]",
253
+ "lstrip": false,
254
+ "normalized": false,
255
+ "rstrip": false,
256
+ "single_word": false,
257
+ "special": true
258
+ },
259
+ "50284": {
260
+ "content": "[MASK]",
261
+ "lstrip": true,
262
+ "normalized": false,
263
+ "rstrip": false,
264
+ "single_word": false,
265
+ "special": true
266
+ },
267
+ "50285": {
268
+ "content": "[unused0]",
269
+ "lstrip": false,
270
+ "normalized": true,
271
+ "rstrip": false,
272
+ "single_word": false,
273
+ "special": false
274
+ },
275
+ "50286": {
276
+ "content": "[unused1]",
277
+ "lstrip": false,
278
+ "normalized": true,
279
+ "rstrip": false,
280
+ "single_word": false,
281
+ "special": false
282
+ },
283
+ "50287": {
284
+ "content": "[unused2]",
285
+ "lstrip": false,
286
+ "normalized": true,
287
+ "rstrip": false,
288
+ "single_word": false,
289
+ "special": false
290
+ },
291
+ "50288": {
292
+ "content": "[unused3]",
293
+ "lstrip": false,
294
+ "normalized": true,
295
+ "rstrip": false,
296
+ "single_word": false,
297
+ "special": false
298
+ },
299
+ "50289": {
300
+ "content": "[unused4]",
301
+ "lstrip": false,
302
+ "normalized": true,
303
+ "rstrip": false,
304
+ "single_word": false,
305
+ "special": false
306
+ },
307
+ "50290": {
308
+ "content": "[unused5]",
309
+ "lstrip": false,
310
+ "normalized": true,
311
+ "rstrip": false,
312
+ "single_word": false,
313
+ "special": false
314
+ },
315
+ "50291": {
316
+ "content": "[unused6]",
317
+ "lstrip": false,
318
+ "normalized": true,
319
+ "rstrip": false,
320
+ "single_word": false,
321
+ "special": false
322
+ },
323
+ "50292": {
324
+ "content": "[unused7]",
325
+ "lstrip": false,
326
+ "normalized": true,
327
+ "rstrip": false,
328
+ "single_word": false,
329
+ "special": false
330
+ },
331
+ "50293": {
332
+ "content": "[unused8]",
333
+ "lstrip": false,
334
+ "normalized": true,
335
+ "rstrip": false,
336
+ "single_word": false,
337
+ "special": false
338
+ },
339
+ "50294": {
340
+ "content": "[unused9]",
341
+ "lstrip": false,
342
+ "normalized": true,
343
+ "rstrip": false,
344
+ "single_word": false,
345
+ "special": false
346
+ },
347
+ "50295": {
348
+ "content": "[unused10]",
349
+ "lstrip": false,
350
+ "normalized": true,
351
+ "rstrip": false,
352
+ "single_word": false,
353
+ "special": false
354
+ },
355
+ "50296": {
356
+ "content": "[unused11]",
357
+ "lstrip": false,
358
+ "normalized": true,
359
+ "rstrip": false,
360
+ "single_word": false,
361
+ "special": false
362
+ },
363
+ "50297": {
364
+ "content": "[unused12]",
365
+ "lstrip": false,
366
+ "normalized": true,
367
+ "rstrip": false,
368
+ "single_word": false,
369
+ "special": false
370
+ },
371
+ "50298": {
372
+ "content": "[unused13]",
373
+ "lstrip": false,
374
+ "normalized": true,
375
+ "rstrip": false,
376
+ "single_word": false,
377
+ "special": false
378
+ },
379
+ "50299": {
380
+ "content": "[unused14]",
381
+ "lstrip": false,
382
+ "normalized": true,
383
+ "rstrip": false,
384
+ "single_word": false,
385
+ "special": false
386
+ },
387
+ "50300": {
388
+ "content": "[unused15]",
389
+ "lstrip": false,
390
+ "normalized": true,
391
+ "rstrip": false,
392
+ "single_word": false,
393
+ "special": false
394
+ },
395
+ "50301": {
396
+ "content": "[unused16]",
397
+ "lstrip": false,
398
+ "normalized": true,
399
+ "rstrip": false,
400
+ "single_word": false,
401
+ "special": false
402
+ },
403
+ "50302": {
404
+ "content": "[unused17]",
405
+ "lstrip": false,
406
+ "normalized": true,
407
+ "rstrip": false,
408
+ "single_word": false,
409
+ "special": false
410
+ },
411
+ "50303": {
412
+ "content": "[unused18]",
413
+ "lstrip": false,
414
+ "normalized": true,
415
+ "rstrip": false,
416
+ "single_word": false,
417
+ "special": false
418
+ },
419
+ "50304": {
420
+ "content": "[unused19]",
421
+ "lstrip": false,
422
+ "normalized": true,
423
+ "rstrip": false,
424
+ "single_word": false,
425
+ "special": false
426
+ },
427
+ "50305": {
428
+ "content": "[unused20]",
429
+ "lstrip": false,
430
+ "normalized": true,
431
+ "rstrip": false,
432
+ "single_word": false,
433
+ "special": false
434
+ },
435
+ "50306": {
436
+ "content": "[unused21]",
437
+ "lstrip": false,
438
+ "normalized": true,
439
+ "rstrip": false,
440
+ "single_word": false,
441
+ "special": false
442
+ },
443
+ "50307": {
444
+ "content": "[unused22]",
445
+ "lstrip": false,
446
+ "normalized": true,
447
+ "rstrip": false,
448
+ "single_word": false,
449
+ "special": false
450
+ },
451
+ "50308": {
452
+ "content": "[unused23]",
453
+ "lstrip": false,
454
+ "normalized": true,
455
+ "rstrip": false,
456
+ "single_word": false,
457
+ "special": false
458
+ },
459
+ "50309": {
460
+ "content": "[unused24]",
461
+ "lstrip": false,
462
+ "normalized": true,
463
+ "rstrip": false,
464
+ "single_word": false,
465
+ "special": false
466
+ },
467
+ "50310": {
468
+ "content": "[unused25]",
469
+ "lstrip": false,
470
+ "normalized": true,
471
+ "rstrip": false,
472
+ "single_word": false,
473
+ "special": false
474
+ },
475
+ "50311": {
476
+ "content": "[unused26]",
477
+ "lstrip": false,
478
+ "normalized": true,
479
+ "rstrip": false,
480
+ "single_word": false,
481
+ "special": false
482
+ },
483
+ "50312": {
484
+ "content": "[unused27]",
485
+ "lstrip": false,
486
+ "normalized": true,
487
+ "rstrip": false,
488
+ "single_word": false,
489
+ "special": false
490
+ },
491
+ "50313": {
492
+ "content": "[unused28]",
493
+ "lstrip": false,
494
+ "normalized": true,
495
+ "rstrip": false,
496
+ "single_word": false,
497
+ "special": false
498
+ },
499
+ "50314": {
500
+ "content": "[unused29]",
501
+ "lstrip": false,
502
+ "normalized": true,
503
+ "rstrip": false,
504
+ "single_word": false,
505
+ "special": false
506
+ },
507
+ "50315": {
508
+ "content": "[unused30]",
509
+ "lstrip": false,
510
+ "normalized": true,
511
+ "rstrip": false,
512
+ "single_word": false,
513
+ "special": false
514
+ },
515
+ "50316": {
516
+ "content": "[unused31]",
517
+ "lstrip": false,
518
+ "normalized": true,
519
+ "rstrip": false,
520
+ "single_word": false,
521
+ "special": false
522
+ },
523
+ "50317": {
524
+ "content": "[unused32]",
525
+ "lstrip": false,
526
+ "normalized": true,
527
+ "rstrip": false,
528
+ "single_word": false,
529
+ "special": false
530
+ },
531
+ "50318": {
532
+ "content": "[unused33]",
533
+ "lstrip": false,
534
+ "normalized": true,
535
+ "rstrip": false,
536
+ "single_word": false,
537
+ "special": false
538
+ },
539
+ "50319": {
540
+ "content": "[unused34]",
541
+ "lstrip": false,
542
+ "normalized": true,
543
+ "rstrip": false,
544
+ "single_word": false,
545
+ "special": false
546
+ },
547
+ "50320": {
548
+ "content": "[unused35]",
549
+ "lstrip": false,
550
+ "normalized": true,
551
+ "rstrip": false,
552
+ "single_word": false,
553
+ "special": false
554
+ },
555
+ "50321": {
556
+ "content": "[unused36]",
557
+ "lstrip": false,
558
+ "normalized": true,
559
+ "rstrip": false,
560
+ "single_word": false,
561
+ "special": false
562
+ },
563
+ "50322": {
564
+ "content": "[unused37]",
565
+ "lstrip": false,
566
+ "normalized": true,
567
+ "rstrip": false,
568
+ "single_word": false,
569
+ "special": false
570
+ },
571
+ "50323": {
572
+ "content": "[unused38]",
573
+ "lstrip": false,
574
+ "normalized": true,
575
+ "rstrip": false,
576
+ "single_word": false,
577
+ "special": false
578
+ },
579
+ "50324": {
580
+ "content": "[unused39]",
581
+ "lstrip": false,
582
+ "normalized": true,
583
+ "rstrip": false,
584
+ "single_word": false,
585
+ "special": false
586
+ },
587
+ "50325": {
588
+ "content": "[unused40]",
589
+ "lstrip": false,
590
+ "normalized": true,
591
+ "rstrip": false,
592
+ "single_word": false,
593
+ "special": false
594
+ },
595
+ "50326": {
596
+ "content": "[unused41]",
597
+ "lstrip": false,
598
+ "normalized": true,
599
+ "rstrip": false,
600
+ "single_word": false,
601
+ "special": false
602
+ },
603
+ "50327": {
604
+ "content": "[unused42]",
605
+ "lstrip": false,
606
+ "normalized": true,
607
+ "rstrip": false,
608
+ "single_word": false,
609
+ "special": false
610
+ },
611
+ "50328": {
612
+ "content": "[unused43]",
613
+ "lstrip": false,
614
+ "normalized": true,
615
+ "rstrip": false,
616
+ "single_word": false,
617
+ "special": false
618
+ },
619
+ "50329": {
620
+ "content": "[unused44]",
621
+ "lstrip": false,
622
+ "normalized": true,
623
+ "rstrip": false,
624
+ "single_word": false,
625
+ "special": false
626
+ },
627
+ "50330": {
628
+ "content": "[unused45]",
629
+ "lstrip": false,
630
+ "normalized": true,
631
+ "rstrip": false,
632
+ "single_word": false,
633
+ "special": false
634
+ },
635
+ "50331": {
636
+ "content": "[unused46]",
637
+ "lstrip": false,
638
+ "normalized": true,
639
+ "rstrip": false,
640
+ "single_word": false,
641
+ "special": false
642
+ },
643
+ "50332": {
644
+ "content": "[unused47]",
645
+ "lstrip": false,
646
+ "normalized": true,
647
+ "rstrip": false,
648
+ "single_word": false,
649
+ "special": false
650
+ },
651
+ "50333": {
652
+ "content": "[unused48]",
653
+ "lstrip": false,
654
+ "normalized": true,
655
+ "rstrip": false,
656
+ "single_word": false,
657
+ "special": false
658
+ },
659
+ "50334": {
660
+ "content": "[unused49]",
661
+ "lstrip": false,
662
+ "normalized": true,
663
+ "rstrip": false,
664
+ "single_word": false,
665
+ "special": false
666
+ },
667
+ "50335": {
668
+ "content": "[unused50]",
669
+ "lstrip": false,
670
+ "normalized": true,
671
+ "rstrip": false,
672
+ "single_word": false,
673
+ "special": false
674
+ },
675
+ "50336": {
676
+ "content": "[unused51]",
677
+ "lstrip": false,
678
+ "normalized": true,
679
+ "rstrip": false,
680
+ "single_word": false,
681
+ "special": false
682
+ },
683
+ "50337": {
684
+ "content": "[unused52]",
685
+ "lstrip": false,
686
+ "normalized": true,
687
+ "rstrip": false,
688
+ "single_word": false,
689
+ "special": false
690
+ },
691
+ "50338": {
692
+ "content": "[unused53]",
693
+ "lstrip": false,
694
+ "normalized": true,
695
+ "rstrip": false,
696
+ "single_word": false,
697
+ "special": false
698
+ },
699
+ "50339": {
700
+ "content": "[unused54]",
701
+ "lstrip": false,
702
+ "normalized": true,
703
+ "rstrip": false,
704
+ "single_word": false,
705
+ "special": false
706
+ },
707
+ "50340": {
708
+ "content": "[unused55]",
709
+ "lstrip": false,
710
+ "normalized": true,
711
+ "rstrip": false,
712
+ "single_word": false,
713
+ "special": false
714
+ },
715
+ "50341": {
716
+ "content": "[unused56]",
717
+ "lstrip": false,
718
+ "normalized": true,
719
+ "rstrip": false,
720
+ "single_word": false,
721
+ "special": false
722
+ },
723
+ "50342": {
724
+ "content": "[unused57]",
725
+ "lstrip": false,
726
+ "normalized": true,
727
+ "rstrip": false,
728
+ "single_word": false,
729
+ "special": false
730
+ },
731
+ "50343": {
732
+ "content": "[unused58]",
733
+ "lstrip": false,
734
+ "normalized": true,
735
+ "rstrip": false,
736
+ "single_word": false,
737
+ "special": false
738
+ },
739
+ "50344": {
740
+ "content": "[unused59]",
741
+ "lstrip": false,
742
+ "normalized": true,
743
+ "rstrip": false,
744
+ "single_word": false,
745
+ "special": false
746
+ },
747
+ "50345": {
748
+ "content": "[unused60]",
749
+ "lstrip": false,
750
+ "normalized": true,
751
+ "rstrip": false,
752
+ "single_word": false,
753
+ "special": false
754
+ },
755
+ "50346": {
756
+ "content": "[unused61]",
757
+ "lstrip": false,
758
+ "normalized": true,
759
+ "rstrip": false,
760
+ "single_word": false,
761
+ "special": false
762
+ },
763
+ "50347": {
764
+ "content": "[unused62]",
765
+ "lstrip": false,
766
+ "normalized": true,
767
+ "rstrip": false,
768
+ "single_word": false,
769
+ "special": false
770
+ },
771
+ "50348": {
772
+ "content": "[unused63]",
773
+ "lstrip": false,
774
+ "normalized": true,
775
+ "rstrip": false,
776
+ "single_word": false,
777
+ "special": false
778
+ },
779
+ "50349": {
780
+ "content": "[unused64]",
781
+ "lstrip": false,
782
+ "normalized": true,
783
+ "rstrip": false,
784
+ "single_word": false,
785
+ "special": false
786
+ },
787
+ "50350": {
788
+ "content": "[unused65]",
789
+ "lstrip": false,
790
+ "normalized": true,
791
+ "rstrip": false,
792
+ "single_word": false,
793
+ "special": false
794
+ },
795
+ "50351": {
796
+ "content": "[unused66]",
797
+ "lstrip": false,
798
+ "normalized": true,
799
+ "rstrip": false,
800
+ "single_word": false,
801
+ "special": false
802
+ },
803
+ "50352": {
804
+ "content": "[unused67]",
805
+ "lstrip": false,
806
+ "normalized": true,
807
+ "rstrip": false,
808
+ "single_word": false,
809
+ "special": false
810
+ },
811
+ "50353": {
812
+ "content": "[unused68]",
813
+ "lstrip": false,
814
+ "normalized": true,
815
+ "rstrip": false,
816
+ "single_word": false,
817
+ "special": false
818
+ },
819
+ "50354": {
820
+ "content": "[unused69]",
821
+ "lstrip": false,
822
+ "normalized": true,
823
+ "rstrip": false,
824
+ "single_word": false,
825
+ "special": false
826
+ },
827
+ "50355": {
828
+ "content": "[unused70]",
829
+ "lstrip": false,
830
+ "normalized": true,
831
+ "rstrip": false,
832
+ "single_word": false,
833
+ "special": false
834
+ },
835
+ "50356": {
836
+ "content": "[unused71]",
837
+ "lstrip": false,
838
+ "normalized": true,
839
+ "rstrip": false,
840
+ "single_word": false,
841
+ "special": false
842
+ },
843
+ "50357": {
844
+ "content": "[unused72]",
845
+ "lstrip": false,
846
+ "normalized": true,
847
+ "rstrip": false,
848
+ "single_word": false,
849
+ "special": false
850
+ },
851
+ "50358": {
852
+ "content": "[unused73]",
853
+ "lstrip": false,
854
+ "normalized": true,
855
+ "rstrip": false,
856
+ "single_word": false,
857
+ "special": false
858
+ },
859
+ "50359": {
860
+ "content": "[unused74]",
861
+ "lstrip": false,
862
+ "normalized": true,
863
+ "rstrip": false,
864
+ "single_word": false,
865
+ "special": false
866
+ },
867
+ "50360": {
868
+ "content": "[unused75]",
869
+ "lstrip": false,
870
+ "normalized": true,
871
+ "rstrip": false,
872
+ "single_word": false,
873
+ "special": false
874
+ },
875
+ "50361": {
876
+ "content": "[unused76]",
877
+ "lstrip": false,
878
+ "normalized": true,
879
+ "rstrip": false,
880
+ "single_word": false,
881
+ "special": false
882
+ },
883
+ "50362": {
884
+ "content": "[unused77]",
885
+ "lstrip": false,
886
+ "normalized": true,
887
+ "rstrip": false,
888
+ "single_word": false,
889
+ "special": false
890
+ },
891
+ "50363": {
892
+ "content": "[unused78]",
893
+ "lstrip": false,
894
+ "normalized": true,
895
+ "rstrip": false,
896
+ "single_word": false,
897
+ "special": false
898
+ },
899
+ "50364": {
900
+ "content": "[unused79]",
901
+ "lstrip": false,
902
+ "normalized": true,
903
+ "rstrip": false,
904
+ "single_word": false,
905
+ "special": false
906
+ },
907
+ "50365": {
908
+ "content": "[unused80]",
909
+ "lstrip": false,
910
+ "normalized": true,
911
+ "rstrip": false,
912
+ "single_word": false,
913
+ "special": false
914
+ },
915
+ "50366": {
916
+ "content": "[unused81]",
917
+ "lstrip": false,
918
+ "normalized": true,
919
+ "rstrip": false,
920
+ "single_word": false,
921
+ "special": false
922
+ },
923
+ "50367": {
924
+ "content": "[unused82]",
925
+ "lstrip": false,
926
+ "normalized": true,
927
+ "rstrip": false,
928
+ "single_word": false,
929
+ "special": false
930
+ }
931
+ },
932
+ "clean_up_tokenization_spaces": true,
933
+ "cls_token": "[CLS]",
934
+ "extra_special_tokens": {},
935
+ "mask_token": "[MASK]",
936
+ "model_input_names": [
937
+ "input_ids",
938
+ "attention_mask"
939
+ ],
940
+ "model_max_length": 8192,
941
+ "pad_token": "[PAD]",
942
+ "sep_token": "[SEP]",
943
+ "tokenizer_class": "PreTrainedTokenizerFast",
944
+ "unk_token": "[UNK]"
945
+ }