• Joined on 2024-10-11
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-19 19:06:52 +00:00
cace440204 test(5-1): 修正变量名大小写并进行代码格式化
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-19 19:05:51 +00:00
6e973586e0 refactor(5-1):重构代码以提高可读性和效率
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-19 19:04:44 +00:00
1d16bebe43 refactor(5-1):重构代码以提高可读性和效率
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-19 19:01:47 +00:00
036a740505 perf(5-1.py):优化 token 权重计算逻辑- 修改了计算 token 权重乘积的方式,仅在 token同时存在于 Amazon 和 Google 数据中时进行计算
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-19 19:00:42 +00:00
38917b896f refactor(5-1):重构代码以提高可读性和性能
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-19 18:55:30 +00:00
dc883eaf72 feat: 创建亚马逊和谷歌权重的广播变量
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-19 18:54:05 +00:00
8bccc2cad7 style(5-1): 移除测试断言中的 f-string- 删除了测试断言中不必要的 f-string 表达式
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-19 18:51:14 +00:00
8fcedbec41 style(5-1): 移除测试断言中的 f-string- 删除了测试断言中不必要的 f-string 表达式
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-19 18:49:45 +00:00
6e80f59ea1 style(5-1): 优化 print 语句格式
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-19 18:47:53 +00:00
9921a965ae feat(5-1.py): 实现可扩展实体匹配算法
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-19 18:38:20 +00:00
33687f9fcd feat(4-1): 重构代码以实现 Amazon 和 Google 数据集的余弦相似度计算
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-19 18:32:21 +00:00
15fcc21975 refactor(4-1):重构数据加载和解析逻辑
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-19 18:29:30 +00:00
e84c0ff633 style(4-1): 替换 f-string 为 format 方法- 将 f-string 替换为 str.format 方法,以适应 Python 3.6 以下版本- 修改了三处 print 语句和一处变量赋值,使用 str.format 方法替代 f-string
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-19 18:24:26 +00:00
250e1b99e0 feat(similarity): 添加文本相似度计算功能
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-16 02:08:54 +00:00
1043551309 feat(3-1.py): 实现 TF-IDF 权重计算并优化代码结构
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-16 01:53:55 +00:00
739ad9d6cb fix(3-1): 更新 HDFS 地址
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-16 01:46:44 +00:00
ceaf470df4 fix(3-1): 更新 HDFS 地址
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-16 01:45:26 +00:00
4c79cad322 fix(data): 更新数据文件路径- 将 GoogleProducts.csv 和 AmazonProducts.csv 替换为 Google.csv 和 Amazon.csv
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-16 01:44:17 +00:00
4233cfc792 fix(3-1): 更新 stopwords.txt 文件路径
fly6516 pushed to master at fly6516s_Educatio.../BD-exp-9 2025-04-16 01:43:38 +00:00
c699e3d21e feat(3-1.py):重构代码以构建倒排索引