python pipeline_track_a.py # 完整跑(CKIP+stanza+SBERT+DistilBERT) python pipeline_track_a.py --from-cache # 跳過 CKIP+stanza,直接從 cache/eval_cache.json python pipeline_track_a.py --save-cache # 額外存 D4 pairs ...
Abstract: While user-oriented service industries are rapidly growing, various network devices provide these services through different access paths. Accordingly, the network flow is also increasing ...
A production-quality NLP pipeline that fine-tunes DistilBERT on the Stanford Sentiment Treebank (SST-2) dataset and benchmarks it against two classical baselines — Logistic Regression and LinearSVC — ...
Abstract: Phishing continues to be a major and rapidly evolving challenge in cybersecurity. By disguising malicious links as legitimate ones, attackers trick users into revealing sensitive information ...
The system takes a PDF and automatically identifies: - Main results - Key figures - Relevant references Tech used: Python, PyMuPDF, Regex, and LLM-based prompt engineering The goal was to transform ...
🚀 𝗠𝗮𝘀𝘁𝗲𝗿𝗶𝗻𝗴 𝗠𝗮𝗰𝗵𝗶𝗻𝗲 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴: 𝗧𝗵𝗲 𝗔𝗹𝗴𝗼𝗿𝗶𝘁𝗵𝗺𝘀 ...