Homepage
Homepage
Home
Publications
CV
Light
Dark
Automatic
1
Does your AI agent get you? A personalizable framework for approximating human models from argumentation-based dialogue traces
Yinxu Tang
,
Stylianos Loukas Vasileiou
,
William Yeoh
PDF
Cite
DOI
Privacy-Preserving Edge Intelligence: A Perspective of Constrained Bandits
Tianyi Zhang
,
Shangshang Wang
,
Yinxu Tang
,
Ziyu Shao
,
Yang Yang
PDF
Cite
DOI
Social-Aware Distributed Meta-Learning: A Perspective of Constrained Graphical Bandits
Shangshang Wang
,
Simeng Bian
,
Yinxu Tang
,
Ziyu Shao
PDF
Cite
DOI
Social-Aware Edge Intelligence: A Constrained Graphical Bandit Approach
Simeng Bian
,
Shangshang Wang
,
Yinxu Tang
,
Ziyu Shao
PDF
Cite
DOI
Learning-Aided Stable Matching for Switch-Controller Association in SDN Systems
Yinxu Tang
,
Tao Huang
,
Xi Huang
,
Ziyu Shao
,
Yang Yang
PDF
Cite
DOI
Green Edge Intelligence Scheme for Mobile Keyboard Emoji Prediction
Jianfeng Hou
,
Yinxu Tang
,
Xi Huang
,
Ziyu Shao
,
Yang Yang
PDF
Cite
DOI
Proactive Cache Placement with Bandit Learning in Fog-Assisted IoT Systems
In fog-assisted IoT systems, it is a common practice to cache popular content at the network edge to achieve high quality of service. Due to various uncertainties such as unknown file popularities in practice, the design of effective cache placement scheme is still an open problem with two key challenges: 1) how to incorporate online learning into the cache placement process to minimize performance loss (a.k.a. regret), and 2) how to maintain caching costs under budgets in the long run. In this paper, we formulate the content cache placement problem with unknown file popularities as a combinatorial multi-armed bandit (CMAB) problem with long-term time-average constraints. We adopt bandit learning methods and virtual queue technique to deal with the exploration-exploitation tradeoff and long-term time-average constraints, respectively. With an effective integration of online learning and online control, we devise a learning-aided cache placement scheme called CPB (Cache Placement with Bandit Learning). Our theoretical analysis and simulation results show that CPB achieves a tunable sublinear regret over a finite time horizon and keeps caching costs within budgets in the long run.
Xin Gao
,
Xi Huang
,
Yinxu Tang
,
Ziyu Shao
,
Yang Yang
PDF
Cite
DOI
Joint Switch-Controller Association and Control Devolution for SDN Systems: An Integration of Online Control and Online Learning
Xi Huang
,
Yinxu Tang
,
Ziyu Shao
,
Yang Yang
,
Hong Xu
PDF
Cite
DOI
Cite
×