交互基准

July 19, 2024 · View on GitHub

我们的交互实验结果参见这里

任务难度分级

我们综合考量两个因素,为任务划分难度级别:理想的交互步骤数 𝑆𝑡𝑒𝑝𝑖𝑑𝑒𝑎𝑙𝑆𝑡𝑒𝑝_{𝑖𝑑𝑒𝑎𝑙} 和任务描述的模糊性(即,模糊度指标 𝑆𝑐𝑜𝑟𝑒vag𝑆𝑐𝑜𝑟𝑒_{vag}, 定义为理想交互步骤数量 𝑆𝑡𝑒𝑝𝑖𝑑𝑒𝑎𝑙𝑆𝑡𝑒𝑝_{𝑖𝑑𝑒𝑎𝑙} 除以任务描述中的交互命令数量)。

interaction task level

任务合集

下表为我们用于测试工具交互能力的30个任务合集,涉及8个安卓应用。

App任务描述StepidealStep_{ideal}ScorevagScore_{vag}ScoreScore难度
美团1点击“外卖”按钮11 / 1 = 12L1
2点击“我的”,点击进入设置页面22 / 2 = 13L1
3点击左上角的定位信息,修改定位改为北京市22 / 2 = 13L1
4点击“外卖 ”按钮,点击“美食”分区,点击进入店铺列表第一家店铺33 / 3 = 14L2
5点击“电影演出”按钮,点击“正在热映”下的第一部电影,点击“想看”按钮33 / 3 = 14L2
6搜索“门票”,点击搜索推荐中的“景点门票频道”,点击“景点”按钮,点击景点列表的第一个景点,点击查看景点的评分66 / 5 = 1.257.25L2
7开启App的“长辈模式”44 / 1 = 48L3
8删除我的全部收藏55 / 1 = 510L3
9点击“我的”,点击设置,新增一个收货地址,点击选择收货地址,选择屏幕下方列表中的第一个地址,然后填写门牌号01,姓名小明,手机号13800000000,最后保存地址1010 / 9 = 1.1111.11L3
小红书10点击第一条推文,点击点赞按钮22 / 2 = 13L1
11点击“我”,点击设置按钮,点击“隐私设置”,点击“在线状态”,将其修改为“公开”55 / 5 = 16L2
12点击导航栏的“购物”按钮,点击一个商品卡片的商品,按照默认配置加入购物车,然后去购物车页面,删除购物车中的所有商品88 / 5 = 1.610.6L3
豆瓣13点击第一条推文的点赞按钮11 / 1 = 12L1
14点击“书影音”标签,点击豆瓣榜单,点击近期热门电影Top20榜单,点击榜单第一部电影的“想看”44 / 4 = 15L2
15点击“我”标签,点击“创建我的书影音”,在屏幕中央向左划,点击创建我的图书TOP10,依次勾选列表中的前三本书,点击确定,点击发布99 / 7 = 1.2910.29L3
Facebook16Click the Like button on the first post11 / 1 = 12L1
17Click the "What's on your mind?" input box, then send a post with the content "Hello everyone"44 / 2 = 26L2
18Click the top Profile tab, click Edit Profile, scroll until you find the Bio section, click the Add button in the Bio section, click "Describe yourself", edit the content to "Hello Sky" and save77 / 7 = 18L3
Gmail19Click to view the first email, then mark it as a favorite22 / 2 = 13L1
20Send an empty email to {address}44 / 2 = 26L2
21Send an email to {address1} and {address2} with the subject "Test" and the body "Hello"88 / 3 = 2.6610.66L3
LinkedIn22Click the avatar of the user who posted the first tweet11 / 1 = 12L1
23View the detail page of the first notification in the notifications list33 / 1 = 36L2
24Click the "Jobs" tab, search for QA Engineer in the search bar, click to view the details of any job from the search results, and then click Save66 / 4 = 1.57.5L3
Google Play25Click the Top Charts tab, then click to view the detail of "Honor of Kings"22 / 2 = 13L1
26Click the Books tab, then enter the ratings and reviews section of the first book33 / 2 = 1.54.5L2
27Download WhatsApp44 / 1 = 48L3
YouTube Music28Click the "Explore" tap, then click the "New releases"22 / 2 = 13L1
29Search singer Jay Chou33 / 1 = 36L2
30Search for "Hello World" and then play any song from the search list44 / 1 = 48L3