Agent Mediated Intelligence Research Group


Selected Publications

2024202320222021202020192018201720162015201420132012201120102009200820072006Thesis Supervised

2024

  • Wanqi Xue, Bo An, Shuicheng Yan, Zhongwen Xu. Reinforcement learning from diverse human preferences. Proceedings of the 33rd International Joint Conference on Artificial Intelligence (IJCAI'24).  

  • Pengjie Gu, Mengchen Zhao, Xu He, Yi Cai, Bo An. PoRank: A practical framework for learning to rank policies. Proceedings of the 33rd International Joint Conference on Artificial Intelligence (IJCAI'24).  

  • Pengdeng Li, Shuxin Li, Chang Yang, Xinrun Wang, Xiao Huang, Hau Chan, Bo An. Self-adaptive PSRO: Towards an automatic population-based game solver. Proceedings of the 33rd International Joint Conference on Artificial Intelligence (IJCAI'24).  

  • Xinrun Wang, Chang Yang, Shuxin Li, Pengdeng Li, Xiao Huang, Hau Chan, Bo An. Reinforcement Nash equilibrium solver. Proceedings of the 33rd International Joint Conference on Artificial Intelligence (IJCAI'24).  

  • Hui Niu, Siyuan Li, Jiahao Zheng, Zhouchi Lin, Jian Li, Jian Guo, Bo An. IMM: An imitative reinforcement learning approach with predictive representation learning for automatic market making. Proceedings of the 33rd International Joint Conference on Artificial Intelligence (IJCAI'24).  

  • Yiwen Zhu, Jinyi Liu, Wenya Wei, Qianyi Fu, Yujing Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan. vMFER: von Mises-Fisher experience resampling based on uncertainty of gradient directions for policy improvement. Proceedings of the 33rd International Joint Conference on Artificial Intelligence (IJCAI'24).  

  • Ruyi An, Yewen Li, Xu He, Pengjie Gu, Mengchen Zhao, Dong Li, Jianye Hao, Bo An, Chaojie Wang, Mingyuan Zhou. Improving unsupervised hierarchical representation with reinforcement learning. Proceedings of the 2024 IEEE Conference on Computer Vision and Pattern Recognition (CVPR'24).  

  • Wentao Zhang, Yilei Zhao, Shuo Sun, Jie Ying, Yonggang Xie, Zitao Song, Xinrun Wang, Bo An. Reinforcement learning with maskable stock representation for portfolio management in customizable stock pools. Proceedings of the 2024 Web Conference (WWW'24). [PDF] 

  • Yuzhou Cao, Lei Feng, Bo An. Consistent hierarchical classification with a generalized metric. Proceedings of the 27th International Conference on Artificial Intelligence and Statistics (AISTATS'24). [PDF] 

  • Shuqi Liu, Yuzhou Cao, Qiaozhen Zhang, Lei Feng, Bo An. Mitigating underfitting in learning to defer with consistent losses. Proceedings of the 27th International Conference on Artificial Intelligence and Statistics (AISTATS'24). [PDF] 

  • Longtao Zheng, Rundong Wang, Xinrun Wang, Bo An. Synapse: Trajectory-as-exemplar prompting with memory for computer control. Proceedings of the 2024 International Conference on Learning Representations (ICLR'24). [PDF] 

  • Weihao Tan, Wentao Zhang, Shanqi Liu, Longtao Zheng, Xinrun Wang, Bo An. True knowledge comes from practice: Aligning large language models with embodied environments via reinforcement learning. Proceedings of the 2024 International Conference on Learning Representations (ICLR'24). [PDF] 

  • Shanqi Liu, Dong Xing, Pengjie Gu, Bo An, Yong Liu, Xinrun Wang. Greedy sequential execution: Solving homogeneous and heterogeneous cooperative tasks with a unified framework. Proceedings of the 2024 International Conference on Learning Representations (ICLR'24). [PDF] 

  • Zixi Wei, Senlin Shu, Yuzhou Cao, Hongxin Wei, Bo An, Lei Feng. Consistent Multi-class classification from multiple unlabeled datasets. Proceedings of the 2024 International Conference on Learning Representations (ICLR'24). [PDF] 

  • Shengjie Zhou, Lue Tao, Yuzhou Cao, Tao Xiang, Bo An, Lei Feng. On the vulnerability of adversarially trained models against two-faced attacks. Proceedings of the 2024 International Conference on Learning Representations (ICLR'24). [PDF] 

  • Safa Messaoud, Billel Mokeddem, Zhenghai Xue, Linsey Pang, Bo An, Haipeng Chen, Sanjay Chawla. S2AC: Energy-based reinforcement learning with stein soft actor critic. Proceedings of the 2024 International Conference on Learning Representations (ICLR'24). [PDF] 

  • Pengdeng Li, Shuxin Li, Xinrun Wang, Jakub Cerny, Youzhi Zhang, Stephen McAleer, Hau Chan, Bo An. Grasper: A generalist pursuer for pursuit-evasion problems. Proceedings of the 23rd International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'24). [PDF] 

  • Pengdeng Li, Runsheng Yu, Xinrun Wang, Bo An. Transition-informed reinforcement learning for large-scale Stackelberg mean-field games. Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI'24). [PDF] 

  • Molei Qin, Shuo Sun, Wentao Zhang, Haochong Xia, Xinrun Wang, Bo An. EarnHFT: Efficient hierarchical reinforcement learning for high frequency trading. Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI'24). [PDF] 

  • Haochong Xia, Shuo Sun, Xinrun Wang, Bo An. Market-GAN: Adding control to financial market data generation with semantic context. Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI'24). [PDF] 

  • Senlin Shu, Haobo Wang, Zhuowei Wang, Bo Han, Tao Xiang, Bo An, Lei Feng. Online binary classifcation from similar and dissimilar data. Machine Learning. [PDF] 

  • Shifei Ding, Wei Du, Ling Ding, Jian Zhang, Lili Guo, Bo An. Robust multi-agent communication with graph information bottleneck optimization. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). [PDF] 

  • Jiaqi Lv, Biao Liu, Lei Feng, Ning Xu, Miao Xu, Bo An, Gang Niu, Xin Geng, Masashi Sugiyama. On the robustness of average losses for partial-label learning. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). [PDF] 

  • Shifei Ding, Wei Du, Ling Ding, Jian Zhang, Lili Guo, Bo An. Multi-agent reinforcement learning with graphical mutual information maximization. IEEE Transactions on Neural Networks and Learning Systems. [PDF] 

2023

  • Youzhi Zhang, Bo An, VS Subrahmanian. Computing optimal Nash equilibria in multiplayer games. Proceedings of the Thirty-seventh Annual Conference on Neural Information Processing Systems (NeurIPS'23). [PDF] 

  • Pengjie Gu, Xinyu Cai, Dong Xing, Xinrun Wang, Mengchen Zhao, Bo An. Offline RL with discrete proxy representations for generalizability in POMDPs. Proceedings of the Thirty-seventh Annual Conference on Neural Information Processing Systems (NeurIPS'23). [PDF] 

  • Zhenghai Xue, Qingpeng Cai, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An. State regularized policy optimization on data with dynamics shift. Proceedings of the Thirty-seventh Annual Conference on Neural Information Processing Systems (NeurIPS'23). [PDF] 

  • Renchunzi Xie, Hongxin Wei, Lei Feng, Yuzhou Cao, Bo An. On the importance of feature separability in predicting out-of-distribution error. Proceedings of the Thirty-seventh Annual Conference on Neural Information Processing Systems (NeurIPS'23). [PDF] 

  • Yuzhou Cao, Hussein Mozannar, Lei Feng, Hongxin Wei, Bo An. In defense of softmax parametrization for calibrated and consistent learning to defer. Proceedings of the Thirty-seventh Annual Conference on Neural Information Processing Systems (NeurIPS'23). [PDF] 

  • Xin Cheng, Yuzhou Cao, Haobo Wang, Hongxin Wei, Bo An, Lei Feng. Regression with cost-based rejection. Proceedings of the Thirty-seventh Annual Conference on Neural Information Processing Systems (NeurIPS'23). [PDF] 

  • Zhibin Duan, Zhiyi Lv, Chaojie Wang, Bo Chen, Bo An, Mingyuan Zhou. Few-shot generation via recalling the episodic-semantic memory like human being. Proceedings of the Thirty-seventh Annual Conference on Neural Information Processing Systems (NeurIPS'23). [PDF] 

  • Shuo Sun, Molei Qin, wentao zhang, Haochong Xia, Chuqiao Zong, Jie Ying, Yonggang Xie, Lingxuan Zhao, Xinrun Wang, Bo An. TradeMaster: A holistic quantitative trading platform empowered by reinforcement learning. Proceedings of the Thirty-seventh Annual Conference on Neural Information Processing Systems (NeurIPS'23). [PDF] 

  • Shuo Sun, Xinrun Wang, Wanqi Xue, Xiaoxuan Lou, Bo An. Mastering stock markets with efficient mixture of diversified trading experts. Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data (KDD'23), pp.2109-2119. [PDF] 

  • Wanqi Xue, Qingpeng Cai, Zhenghai Xue, Shuo Sun, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An. PrefRec: Recommender systems with human preferences for reinforcing long-term user engagement. Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data (KDD'23), pp.2874-2884. [PDF] 

  • Hongxin Wei, Huiping Zhuang, Renchunzi Xie, Lei Feng, Gang Niu, Bo An, Yixuan Li. Mitigating memorization of noisy labels by clipping the model prediction. Proceedings of the 40th International Conference on Machine Learning (ICML'23), pp.36868-36886. [PDF] 

  • Xin Cheng, Yuzhou Cao, Ximing Li, Bo An, Lei Feng. Weakly supervised regression with interval targets. Proceedings of the 40th International Conference on Machine Learning (ICML'23), pp.5428-5448. [PDF] 

  • Dong Xing, Pengjie Gu, Qian Zheng, Xinrun Wang, Shanqi Liu, Longtao Zheng, Bo An, Gang Pan. Controlling type confounding in ad hoc teamwork with instance-wise teammate feedback rectification. Proceedings of the 40th International Conference on Machine Learning (ICML'23), pp.38272-38285. [PDF] 

  • Hao Cheng, Shufeng Kong, Yanchen Deng, Caihua Liu, Xiaohu Wu, Bo An, Chongjun Wang. Exploring leximin principle for fair core-selecting combinatorial auctions: Payment rule design and implementation. Proceedings of the 32nd International Joint Conference on Artificial Intelligence (IJCAI'23), pp.2581-2588. [PDF] 

  • Haipeng Chen, Bryan Wilder, Wei Qiu, Bo An, Eric Rice, Milind Tambe. A learning approach to complex contagion influence maximization. Proceedings of the 32nd International Joint Conference on Artificial Intelligence (IJCAI'23), pp.5531-5540. [PDF] 

  • Wanqi Xue, Qingpeng Cai, Ruohan Zhan, Dong Zheng, Peng Jiang, Kun Gai, Bo An. ResAct: Reinforcing long-term engagement in sequential recommendation with residual actor. Proceedings of the 2023 International Conference on Learning Representations (ICLR'23). [PDF] 

  • Wei Qiu, Xiao Ma, Bo An, Svetlana Obraztsova, Shuicheng Yan, Zhongwen Xu. RPM: Generalizable behaviors for multi-agent reinforcement learning. Proceedings of the 2023 International Conference on Learning Representations (ICLR'23). [PDF] 

  • Pengdeng Li, Xinrun Wang, Shuxin Li, Hau Chan, Bo An. Scaling laws in mean-field games. Proceedings of the 2023 International Conference on Learning Representations (ICLR'23). [PDF] 

  • Shuqi Liu, Yuzhou Cao, Qiaozhen Zhang, Lei Feng, Bo An. Consistent complementary-label learning via order-preserving losses. Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS'23), pp.8734-8748. [PDF] 

  • Qian Che, Fengchen Wang, Tianchi Qiao, Xiang Liu, Jiuchuan Jiang, Bo An, Wanyuan Wang, Yichuan Jiang. Structural credit assignment-guided coordinated MCTS: An efficient and scalable method for online multiagent planning. Proceedings of the 22nd International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'23), pp.543-551. [PDF] 

  • Youzhi Zhang, Bo An, V.S. Subrahmanian. Finding optimal nash equilibria in multiplayer games via correlation plans. Proceedings of the 22nd International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'23), pp.2712-2714. [PDF] 

  • Haipeng Chen, Bryan Wilder, Wei Qiu, Bo An, Eric Rice, Milind Tambe. A learning approach to complex contagion influence maximization. Proceedings of the 22nd International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'23), pp.2622-2624. [PDF] 

  • Wei Qiu, Weixun Wang, Rundong Wang, Bo An, Yujing Hu, Svetlana Obraztsova, Zinovi Rabinovich, Jianye Hao, Yingfeng Chen, Changjie Fan. Off-beat multi-agent reinforcement learning. Proceedings of the 22nd International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'23), pp.2424-2426. [PDF] 

  • Shuxin Li, Xinrun Wang, Youzhi Zhang, Wanqi Xue, Jakub Cerny, Bo An. Solving large-scale pursuit-evasion games using pre-trained strategies. Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI'23), pp.11586-11594. [PDF] 

  • Linjian Meng, Zhenxing Ge, Pinzhuo Tian, Bo An, Yang Gao. Deep FTRL-ORW: An efficient FTRL-based deep reinforcement learning algorithm for solving imperfect information extensive-form game. Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI'23), pp.5823-5831. [PDF] 

  • Xin Cheng, Deng-Bao Wang, Lei Feng, Min-Ling Zhang, Bo An. Partial-label regression. Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI'23), pp.7140-7147. [PDF] 

  • Shijie Han, Siyuan Li, Bo An, Wei Zhao, Peng Liu. Classifying ambiguous identities in hidden-role Stochastic games with multi-agent reinforcement learning. Journal of Autonomous Agents and Multi-Agent Systems (JAAMAS), Vol.37, No.2, article 35, 2023. [PDF] 

  • Shuo Sun, Molei Qin, Xinrun Wang, Bo An. PRUDEX-Compass: Towards systematic evaluation of reinforcement learning in financial markets. Transactions on Machine Learning Research (TMLR). [PDF] 

  • Shuo Sun, Rundong Wang, Bo An. Reinforcement learning for quantitative trading. ACM Transactions on Intelligent Systems and Technology, Vol.14, No.3, pp.1-29, 2023. [PDF] 

  • Hongxin Wei, Renchunzi Xie, Lei Feng, Bo Han, Bo An. Deep learning from multiple noisy annotators as a union. IEEE Transactions on Neural Networks and Learning Systems, Vol.34, No.12, 10552-10562, 2023. [PDF] 

  • Lei Feng, Senlin Shu, Yuzhou Cao, Lue Tao, Hongxin Wei, Tao Xiang, Bo An, Gang Niu. Multiple-instance learning from unlabeled bags with pairwise similarity. IEEE Transactions on on Knowledge and Data Engineering, Vol.35, No.11, 11599-11609, 2023. [PDF] 

2022

  • Yanchen Deng, Shufeng Kong, Caihua Liu, Bo An. Deep attentive belief propagation: Integrating reasoning and learning for solving constraint optimization problems. Proceedings of the Thirty-sixth Annual Conference on Neural Information Processing Systems (NeurIPS'22). [PDF] 

  • Yewen Li, Chaojie Wang, Xiaobo Xia, Tongliang Liu, Xin Miao, Bo An. Out-of-distribution detection with an adaptive likelihood ratio on informative hierarchical VAE. Proceedings of the Thirty-sixth Annual Conference on Neural Information Processing Systems (NeurIPS'22). [PDF] 

  • Yewen Li, Chaojie Wang, Zhibin Duan, Dongsheng Wang, Bo Chen, Mingyuan Zhou, Bo An. Alleviating ``posterior collapse'' in deep topic models via policy gradient. Proceedings of the Thirty-sixth Annual Conference on Neural Information Processing Systems (NeurIPS'22). [PDF] 

  • Yuzhou Cao, Lei Feng, Tianchi Cai, Lihong Gu, Jinjie Gu, Bo An, Gang Niu, Masashi Sugiyama. Generalizing consistent multi-class classification with rejection to be compatible with arbitrary losses. Proceedings of the Thirty-sixth Annual Conference on Neural Information Processing Systems (NeurIPS'22). [PDF] 

  • Shuo Sun, Rundong Wang, Wanqi Xue, Xu He, Junlei Zhu, Jian Li, Bo An. DeepScalper: A risk-aware reinforcement learning framework to capture fleeting intraday trading opportunities. Proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM'22), pp.1858-1867. [PDF] 

  • Junning Liu, Xinjian Li, Bo An, Zijie Xia and Xu Wan. Multi-faceted hierarchical multi-task learning for recommender systems. Proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM'22), pp.3332-3341. [PDF] 

  • Hongxin Wei, Lue Tao, Renchunzi Xie, Lei Feng, Bo An. Open-sampling: Exploring out-of-distribution data for re-balancing long-tailed datasets. Proceedings of the 39th International Conference on Machine Learning (ICML'22), pp.23615-23630. [PDF] 

  • Pengjie Gu, Mengchen Zhao, Chen Chen, Dong Li, Jianye Hao, Bo An. Learning pseudometric-based action representations for offline reinforcement learning. Proceedings of the 39th International Conference on Machine Learning (ICML'22), pp.7902-7918. [PDF] 

  • Hongxin Wei, Renchunzi Xie, Hao Cheng, Lei Feng, Bo An, Yixuan Li. Mitigating neural network overconfidence with logit normalization. Proceedings of the 39th International Conference on Machine Learning (ICML'22), pp.23631-23644. [PDF] 

  • Jakub Cerny, Bo An, Allan N. Zhang. Quantal correlated equilibrium in normal form games. Proceedings of the 23rd ACM Conference on Economics and Computation (EC'22), pp.210-239. [PDF] 

  • Youzhi Zhang, Bo An, V.S. Subrahmanian. Correlation-based algorithm for team-maxmin equilibrium in multiplayer extensive-form games. Proceedings of the 31st International Joint Conference on Artificial Intelligence (IJCAI'22), pp.606-612. [PDF] 

  • Aye Phyu Phyu Aung, Xinrun Wang, Runsheng Yu, Bo An, Senthilnath Jayavelu, Xiaoli Li. DO-GAN: A double oracle framework for generative adversarial networks. Proceedings of the 2022 IEEE Conference on Computer Vision and Pattern Recognition (CVPR'22), pp.11265-11274. [PDF] 

  • Pengjie Gu, Mengchen Zhao, Jianye Hao, Bo An. Online ad hoc teamwork under partial observability. Proceedings of the 2022 International Conference on Learning Representations (ICLR'22). [PDF] 

  • Wanqi Xue, Wei Qiu, Bo An, Zinovi Rabinovich, Svetlana Obraztsova, Chai Kiat Yeo. Mis-spoke or mis-lead: Achieving robustness in multi-agent communicative reinforcement learning. Proceedings of the 21st International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'22), pp.1418-1426. [PDF] 

  • Wanyuan Wang, Gerong Wu, Weiwei Wu, Yichuan Jiang, Bo An. Online collective multiagent planning by offline policy reuse with applications to city-scale mobility-on-demand systems. Proceedings of the 21st International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'22), pp.1364-1372. [PDF] 

  • Wanqi Xue, Bo An, Chai Kiat Yeo. NSGZero: Efficiently learning non-exploitable policy in large-scale network security games with neural monte carlo tree search. Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI'22), pp.4646-4653. [PDF] 

  • Yanchen Deng, Shufeng Kong, Bo An. Pretrained cost model for distributed constraint optimization problems. Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI'22), pp.9331-9340. [PDF] 

  • Renchunzi Xie, Hongxin Wei, Lei Feng, Bo An. GearNet: Stepwise dual learning for weakly supervised domain adaptation. Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI'22), pp.8717-8725. [PDF] 

  • Bo An, Shuo Sun, Rundong Wang. Deep Reinforcement learning for quantitative trading: Challenges and opportunities. IEEE Intelligent Systems, Vol.37, No.2, pp.23-26, 2022. [PDF] 

  • Lei Feng, Jun Huang, Senlin Shu, Bo An. Regularized matrix factorization for multi-label learning with missing labels. IEEE Transactions on Cybernetics, Vol.52, No.5, pp.3710-3722, 2022. [PDF] 

  • Jiuchuan Jiang, Kai Di, Bo An, Yichuan Jiang, Zhan Bu, Jie Cao. Batch crowdsourcing for complex tasks based on distributed team formation in e-markets. IEEE Transactions on Transactions on Parallel and Distributed Systems, Vol.33, No.12, pp.3600-3615, 2022. [PDF] 

  • Zhuowei Wang, Jing Jiang, Bo Han, Lei Feng, Bo An, Gang Niu, Guodong Long. SemiNLL: A framework of noisy-label learning by semi-supervised learning. Transactions on Machine Learning Research (TMLR), August 2022. [PDF] 

2021

  • HongxinWei, Lue Tao, Renchunzi Xie, Bo An. Open-set label noise can improve robustness against inherent label noise. Proceedings of the Thirty-fifth Annual Conference on Neural Information Processing Systems (NeurIPS'21), pp.7978-7992. [PDF] 

  • Wei Qiu, Xinrun Wang, Runsheng Yu, Rundong Wang, Xu He, Bo An, Svetlana Obraztsova, Zinovi Rabinovich. RMIX: Learning risk-sensitive policies for cooperative reinforcement learning agents. Proceedings of the Thirty-fifth Annual Conference on Neural Information Processing Systems (NeurIPS'21), pp.23049-23062. [PDF] 

  • Lei Feng, Senlin Shu, Yuzhou Cao, Lue Tao, Hongxin Wei, Tao Xiang, Bo An, Gang Niu. Multiple-instance learning from similar and dissimilar bags. Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data (KDD'21), pp.374–382. [PDF] 

  • Haipeng Chen, Wei Qiu, Han-Ching Ou, Bo An, Milind Tambe. Contingency-aware influence maximization: A reinforcement learning approach. Proceedings of the 2021 Conference on Uncertainty in Artificial Intelligence (UAI'21), pp.1535-1545. [PDF] 

  • Yuzhou Cao, Lei Feng, Yitian Xu, Bo An, Gang Niu, Masashi Sugiyama. Learning from similarity-confidence data. Proceedings of the 38th International Conference on Machine Learning (ICML'21), pp.1272-1282. [PDF] 

  • Lei Feng, Senlin Shu, Nan Lu, Bo Han, Miao Xu, Gang Niu, Bo An, Masashi Sugiyama. Pointwise binary classification with pairwise confidence comparisons. Proceedings of the 38th International Conference on Machine Learning (ICML'21), pp.3252-3262. [PDF] 

  • Yanchen Deng, Runsheng Yu, Xinrun Wang, Bo An. Neural regret matching for distributed constraint optimization problems. Proceedings of the 30th International Joint Conference on Artificial Intelligence (IJCAI'21), pp.146-153. [PDF] [Appendix] 

  • Wanqi Xue, Youzhi Zhang, Shuxin Li, Bo An, Chai Kiat Yeo. Solving large-scale extensive-form network security games via neural fictitious self-play. Proceedings of the 30th International Joint Conference on Artificial Intelligence (IJCAI'21), pp.3713-3720. [PDF] 

  • Shuxin Li, Youzhi Zhang, Xinrun Wang, Wanqi Xue, Bo An. CFR-MIX: Solving imperfect information extensive-form games with combinatorial action space. Proceedings of the 30th International Joint Conference on Artificial Intelligence (IJCAI'21), pp.3663-3669. [PDF] 

  • Youzhi Zhang, Bo An, Jakub Cerny. Computing ex ante coordinated team-maxmin equilibria in zero-sum multiplayer extensive-form games. Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI'21), pp.5813-5821. [PDF] 

  • Rundong Wang, Hongxin Wei, Bo An, Zhouyan Feng, Jun Yao. Commission fee is not enough: A hierarchical reinforced framework for portfolio management. Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI'21), pp.626-633. [PDF] 

  • David Milec, Jakub Cerny, Viliam Lisy, Bo An. Complexity and algorithms for exploiting quantal opponents in large two-player games. Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI'21), pp.5575-5583. [PDF] 

  • Jakub Cerny, Viliam Lisy, Branislav Bosansky, Bo An. Computing quantal Stackelberg equilibrium in extensive-form games. Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI'21), pp.5260-5268. [PDF] 

  • Runsheng Yu, Yu Gong, Xu He, Bo An, Yu Zhu, Qingwen Liu, Wenwu Ou. Personalized adaptive meta learning for cold-start user preference prediction. Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI'21), pp.10772-10780. [PDF] 

  • Lei Feng, Hongxin Wei, Qingyu Guo, Zhuoyi Lin, Bo An. Embedding-augmented generalized matrix factorization for recommendation with implicit feedback. IEEE Intelligent Systems, Vol.36, No.6, pp.32-41, 2021. [PDF] 

  • Jiuchuan Jiang, Bo An, Yichuan Jiang, Chenyan Zhang, Zhan Bu, Jie Cao. Group-oriented task allocation for crowdsourcing in social networks. IEEE Transactions on Systems, Man, and Cybernetics: Systems, Vol.51, No.7, pp.4417-4432, 2021. [PDF] 

  • Yanchen Deng, Bo An. Utility distribution matters: Enabling fast belief propagation for multi‑agent optimization with dense local utility function. Journal of Autonomous Agents and Multi-Agent Systems, Vol.35, No.2, Article 24, 2021. [PDF] 

  • Yanhai Xiong, Bo An, Sarit Kraus. Electric vehicle charging strategy study and the application on charging station placement. Journal of Autonomous Agents and Multi-Agent Systems, Vol.35, No.1, Article 3, 2021. [PDF] 

  • Wanyuan Wang, Zichen Dong, Bo An, Yichuan Jiang. Toward efficient city-scale patrol planning using decomposition and grafting. IEEE Transactions on Intelligent Transportation Systems, Vol.22, No.2, pp.747-757, 2021. [PDF] 

2020

  • Lei Feng, Jiaqi Lv, Bo Han, Miao Xu, Gang Niu, Xin Geng, Bo An, Masashi Sugiyama. Provably consistent partial-label learning. Proceedings of the Thirty-fourth Annual Conference on Neural Information Processing Systems (NeurIPS'20). [PDF] 

  • Xu He, Bo An, Yanghua Li, Haikai Chen, Qingyu Guo, Xin Li, Zhirong Wang. Contextual user browsing bandits for large-scale online mobile recommendation. Proceedings of the 2020 ACM Recommender Systems conference (RecSys'20), pp.63-72. [PDF] 

  • Xu He, Bo An, Yanghua Li, Haikai Chen, Rundong Wang, Xinrun Wang, Runsheng Yu, Xin Li, Zhirong Wang. Learning to collaborate in multi-module recommendation via multi-agent reinforcement learning without communication. Proceedings of the 2020 ACM Recommender Systems conference (RecSys'20), pp.210-219. [PDF] 

  • Feifei Lin, Xu He, Bo An. Context-aware multi-agent coordination with loose couplings and repeated interaction. Proceedings of the 2nd International Conference on Distributed Artificial Intelligence (DAI'20), pp.103-125. [PDF] 

  • Yanchen Deng, Zongmin Qiu, Yong Wang, Yinghui Xu, Bo An. Battery management for automated warehouses via deep reinforcement learning. Proceedings of the 2nd International Conference on Distributed Artificial Intelligence (DAI'20), pp.126-139. [PDF] 

  • Youzhi Zhang, Bo An. Converging to team-maxmin equilibria in zero-sum multiplayer games. Proceedings of the 37th International Conference on Machine Learning (ICML'20), pp.11033-11043. [PDF] 

  • Rundong Wang, Xu He, Runsheng Yu, Wei Qiu, Bo An, Zinovi Rabinovich. Learning efficient multi-agent communication: An information bottleneck approach. Proceedings of the 37th International Conference on Machine Learning (ICML'20), pp.9908-9918. [PDF] 

  • Lei Feng, Takuo Kaneko, Bo Han, Gang Niu, Bo An, Masashi Sugiyama. Learning with multiple complementary labels. Proceedings of the 37th International Conference on Machine Learning (ICML'20), pp.3072-3081. [PDF] 

  • Xu He, Haipeng Chen, Bo An. Learning behaviors with uncertain human feedback. Proceedings of the 2020 Conference on Uncertainty in Artificial Intelligence (UAI'20), pp.131-140. [PDF] 

  • Jakub Cerny, Branislav Bosansky, Bo An. Finite state machines play extensive-form games. Proceedings of the 21st ACM Conference on Economics and Computation (EC'20), pp.509-533. [PDF] 

  • Yanchen Deng, Bo An. Speeding up incomplete GDL-based algorithms for multi-agent optimization with dense local utilities. Proceedings of the 29th International Joint Conference on Artificial Intelligence (IJCAI'20), pp.31-38. [PDF] 

  • Rundong Wang, Runsheng Yu, Bo An, Zinovi Rabinovich. I^2HRL: Interactive influence-based hierarchical reinforcement learning. Proceedings of the 29th International Joint Conference on Artificial Intelligence (IJCAI'20), pp.3131-3138. [PDF] 

  • Jakub Cerny, Viliam Lisy, Branislav Bošanský, Bo An. Dinkelbach-type algorithm for computing quantal Stackelberg equilibrium. Proceedings of the 29th International Joint Conference on Artificial Intelligence (IJCAI'20), pp.246-253. [PDF] 

  • Lei Feng, Senlin Shu, Zhuoyi Lin, Fengmao Lv, Li Li, Bo An. Can cross entropy loss be robust to label noise? Proceedings of the 29th International Joint Conference on Artificial Intelligence (IJCAI'20), pp.2206-2212. [PDF] 

  • Hongxin Wei, Lei Feng, Xiangyu Chen, Bo An. Combating noisy labels by agreement: A joint training method with co-regularization. Proceedings of the 2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR'20), pp.13726-13735. [PDF] 

  • Aye Phyu Phyu Aung, Xinrun Wang, Bo An, Xiaoli Li. We mind your well-being: Preventing depression in uncertain social networks by sequential interventions. Proceedings of the 30th International Conference on Automated Planning and Scheduling (ICAPS'20), pp.499-507. [PDF] 

  • Zhenyu Shi, Runsheng Yu, Xinrun Wang, Rundong Wang, Youzhi Zhang, Hanjiang Lai, Bo An. Learning expensive coordination: An event-based deep RL approach. Proceedings of the 2020 International Conference on Learning Representations (ICLR'20). [PDF] 

  • Youzhi Zhang, Bo An. Computing team-maxmin equilibria in zero-sum multiplayer extensive-form games. Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI'20), pp.2318-2325. [PDF] 

  • Xiaobo Ma, Bo An, Mengchen Zhao, Xiapu Luo, Lei Xue, Zhenhua Li, Tony Miu, Xiaohong Guan. Randomized security patrolling for link flooding attack detection. IEEE Transactions on Dependable and Secure Computing, Vol.7, No.4, pp.940-955, 2020. [PDF] 

  • Wanyuan Wang, Bo An, Yichuan Jiang. Optimal spot-checking for improving the evaluation quality of crowdsourcing: Application to peer grading systems. IEEE Transactions on Computational Social Systems, Vol.17, No.4, pp.795-812, 2020. [PDF] 

  • Jiuchuan Jiang, Bo An, Yichuan Jiang, Donghui Lin. Context-aware reliable crowdsourcing in social networks. IEEE Transactions on Systems, Man, and Cybernetics: Systems, Vol.50, No.2, pp.617-632, 2020. [PDF] 

2019

  • Jiarui Gan, Qingyu Guo, Long Tran-Thanh, Bo An, Michael Wooldridge. Manipulating a learning defender and ways to counteract. Proceedings of the 33rd Annual Conference on Neural Information Processing Systems (NeurIPS'19), pp.8272-8281. [PDF] 

  • Haipeng Chen, Yan Jiao, Zhiwei Qin, Xiaocheng Tang, Hao Li, Bo An, Hongtu Zhu, Jieping Ye. InBEDE: Integrating contextual bandit with td learning for joint pricing and dispatch of ride-hailing platforms. Proceedings of the 19th IEEE International Conference on Data Mining (ICDM'19), pp.61-70. [PDF] 

  • Xinrun Wang, Milind Tambe, Branislav Bosansky, Bo An. When players affect target values: Modeling and solving dynamic partially observable security games. Proceedings of the 10th Conference on Decision and Game Theory for Security (GameSec'19), pp.542-562. [PDF] 

  • Yoav Ben Yaakov, Xinrun Wang, Joachim Meyer, Bo An. Choosing protection: User investments in security measures for cyber risk management. Proceedings of the 10th Conference on Decision and Game Theory for Security (GameSec'19), pp.33-44. [PDF] 

  • Lei Feng, Bo An. Partial label learning by semantic difference maximization. Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI'19), pp.2294-2300. [PDF] 

  • Xinrun Wang, Bo An, Hau Chan. Who should pay the cost: A game-theoretic model for government subsidized investments to improve national cybersecurity. Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI'19), pp.6020-6027. [PDF] 

  • Wei Qiu, Haipeng Chen, Bo An. Dynamic electronic toll collection via multi-agent deep reinforcement learning with edge-based graph convolutional network representation. Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI'19), pp.4568-4574. [PDF] 

  • Jiang Rong, Tao Qin, Bo An. Competitive bridge bidding with deep neural networks. Proceedings of the 18th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'19), pp.16-24. [PDF] 

  • Wanyuan Wang, Zichen Dong, Bo An, Yichuan Jiang. Efficient city-scale patrolling using decomposition and grafting. Proceedings of the 18th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'19), pp.2259-2261. [PDF] 

  • Qingyu Guo, Zhao Li, Bo An, Pengrui Hui, Jiaming Huang, Long Zhang, Mengchen Zhao. Securing the deep fraud detector in large-scale e-commerce platform via adversarial machine learning approach. Proceedings of the 2019 Web Conference (WWW'19), pp.616-626. [PDF] 

  • Lei Feng, Bo An. Partial label learning with self-guided retraining. Proceedings of the 33rd AAAI Conference on Artificial Intelligence (AAAI'19), pp.3542-3549. [PDF] 

  • Lei Feng, Bo An, Shuo He. Collaboration based multi-label learning. Proceedings of the 33rd AAAI Conference on Artificial Intelligence (AAAI'19), pp.3550-3557. [PDF] 

  • Youzhi Zhang, Qingyu Guo, Bo An, Long Tran-Thanh, Nicholas Jennings. Optimal interdiction of urban criminals with the aid of real-time information. Proceedings of the 33rd AAAI Conference on Artificial Intelligence (AAAI'19), pp.1262-1269. [PDF] [Appendix] 

  • Qingyu Guo, Jiarui Gan, Fei Fang, Long Tran-Thanh, Milind Tambe, Bo An. On the inducibility of Stackelberg equilibrium in security games. Proceedings of the 33rd AAAI Conference on Artificial Intelligence (AAAI'19), pp.2020-2028. [PDF] 

  • Jan Karwowski, Jacek Mandziuk, Adam Zychowski, Filip Grajek, Bo An. A memetic approach for sequential security games on a plane with moving targets. Proceedings of the 33rd AAAI Conference on Artificial Intelligence (AAAI'19), pp.970-977. [PDF] 

  • Jiuchuan Jiang, Bo An, et al. Batch allocation for tasks with overlapping skill requirements in crowdsourcing. IEEE Transactions on Transactions on Parallel and Distributed Systems, Vol.30, No.8, pp.1722-1737, 2019. [PDF] 

  • Wanyuan Wang, Zhanpeng He, Peng Shi, Weiwei Wu, Yichuan Jiang, Bo An, et al. Strategic social team crowdsourcing: Forming a Team of truthful workers for crowdsourcing in social networks. IEEE Transactions on Mobile Computing, Vol.18, No.6, pp.1419-1432, 2019. [PDF] 

2018

  • Lei Feng, Bo An. Leveraging latent label distributions for partial label learning. Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI'18), pp.2107-2113. [PDF] 

  • Mengchen Zhao, Zhao Li, Bo An, Haifeng Lu, Yifan Yang, Chen Chu. Impression allocation for combating fraud in e-commerce via deep reinforcement learning with action norm penalty. Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI'18), pp.3940-3946. [PDF] 

  • Arunesh Sinha, Fei Fang, Bo An, Christopher Kiekintveld, Milind Tambe. Stackelberg security games: Looking beyond a decade of success. Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI'18), pp.5494-5501. [PDF] 

  • Kai Wang, Qingyu Guo, Phebe Vayanos, Milind Tambe, Bo An. Equilibrium refinement in security games with arbitrary scheduling constraints. Proceedings of the 17th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'18), pp.919-927. [PDF] 

  • Qingyu Guo, Jiarui Gan, Fei Fang, Long Tran-Thanh, Milind Tambe, Bo An. Inducible equilibrium for security games. Proceedings of the 17th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'18), pp.1947-1949. [PDF] 

  • Yanhai Xiong, Haipeng Chen, Mengchen Zhao, Bo An. HogRider: Champion agent of Microsoft Malmo collaborative AI challenge. Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI'18), pp.4767-4774. [PDF] 

  • Wanyuan Wang, Bo An, Yichuan Jiang. Optimal spot-checking for improving evaluation accuracy of peer grading systems. Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI'18), pp.833-840. [PDF] 

  • Mengchen Zhao, Bo An, Yaodong Yu, Sulin Liu, Sinno Jialin Pan. Data poisoning attacks on multi-task relationship learning. Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI'18), pp.2628-2635. [PDF] 

  • Xinrun Wang, Bo An, Martin Strobel, Fookwai Kong. Catching Captain Jack: Efficient time and space dependent patrols to combat oil-siphoning in international waters. Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI'18), pp.208-215. [PDF] 

  • Haipeng Chen, Bo An, Guni Sharon, Josiah Hanna, Peter Stone, Chunyan Miao, Yeng Chai Soh. DyETC: Dynamic electronic toll collection for traffic congestion alleviation. Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI'18), pp.757-765. [PDF] 

  • Jiang Rong, Tao Qin, Bo An. Dynamic pricing for reusable resources in competitive market with stochastic demand. Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI'18), pp.4718-4726. [PDF] 

  • Yue Yin, Yevgeniy Vorobeychik, Bo An, Noam Hazon. Optimal defense against election control by deleting voter groups. Artificial Intelligence, Vol.259, pp.32-51, 2018. [PDF]  

  • Yanhai Xiong, Jiarui Gan, Bo An, Chunyan Miao, Ana Bazzan. Optimal electric vehicle fast charging station placement based on game theoretical framework. IEEE Transactions on Intelligent Transportation Systems, Vol.19, No.8, pp.2493-2504, 2018. [PDF] 

  • Jiuchuan Jiang, Bo An, Yichuan Jiang, Donghui Lin, Zhan Bu, Jie Cao. Understanding crowdsourcing systems from a multiagent perspective and approach. ACM Transactions on Autonomous and Adaptive Systems, Vol.13, No.2, pp.1--32, 2018. [PDF] 

  • Bo An, Nicholas R. Jennings, Zhenhui Jessie Li. ACM TIST Special Issue on Urban Intelligence. ACM Transactions on Intelligent Systems and Technology, Vol.9, No.3, Article 4, 2018. [PDF]  

  • Bo An. Review of "Predicting Human Decision-Making" by Rosenfeld and Kraus. Artificial Intelligence, Vol.263, pp.1-2, 2018. [PDF] 

2017

  • Bo An. Game theoretic analysis of security and sustainability. Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI'17), pp.5111-5115, accompanying paper for the Early Career Spotlight invited talk. [PDF] [Slides] 

  • Qingyu Guo, Bo An, Long Tran-Than. Playing repeated network interdiction games with semi-bandit feedback. Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI'17), pp.3682-3690. [PDF] 

  • Qingyu Guo, Bo An, Branislav Bosansky, Christopher Kiekintveld. Comparing strategic secrecy and Stackelberg commitment in security games. Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI'17), pp.3691-3699. [PDF] 

  • Mengchen Zhao, Bo An, Wei Gao, Teng Zhang. Efficient label contamination attacks against black-box learning models. Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI'17), pp.3945-3951. [PDF] 

  • Youzhi Zhang, Bo An, Long Tran-Thanh, Nicholas R. Jennings, Zhen Wang, Jiarui Gan. Optimal escape interdiction on transportation networks. Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI'17), pp.3936-3944. [PDF] 

  • Shuxin Li, Xiaohong Li, Jianye Hao, Bo An, Zhiyong Feng, Kangjie chen, Chengwei Zhang. Defending against man-in-the-middle attack in repeated games. Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI'17), pp.3742-3748. [PDF] 

  • Xinrun Wang, Qingyu Guo, Bo An. Stop nuclear smuggling through efficient container inspection. Proceedings of the 16th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'17), pp.669-677. [PDF] 

  • Jiang Rong, Tao Qin, Bo An and Tie-Yan Liu. Pricing optimization for selling reusable resources. Proceedings of the 16th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'17), pp.1719-1721. [PDF] 

  • Jiarui Gan, Bo An, Yevgeniy Vorobeychik, Brian Gauch. Security games on a plane. Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI'17), pp.530-536. [PDF] 

  • Jiang Rong, Tao Qin, Bo An and Tie-Yan Liu. Revenue maximization for finitely repeated ad auctions. Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI'17), pp.663-669. [PDF] 

  • Shanshan Feng, Gao Cong, Bo An and Yeow Meng Chee. POI2Vec: Geographical latent representation for predicting future visitors. Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI'17), pp.102-108. [PDF] 

  • Xiaohong Li, Shuxin Li, Jianye Hao, Zhiyong Feng, Bo An. Optimal personalized defense strategy against man-in-the-middle attack main information. Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI'17), pp.593-599. [PDF] 

  • Galit Haim, Kobi Gal, Bo An, Sarit Kraus. Human-computer negotiation in a three player market setting. Artificial Intelligence, Vol.246, pp.34-52, 2017. [PDF]  

  • Wanyuan Wang, Jiuchuan Jiang, Bo An, Yichuan Jiang, Bing Chen. Toward efficient team formation for crowdsourcing in non-cooperative social networks. IEEE Transactions on Cybernetics, Vol.47, No.12, pp.4208-4222, 2017. [PDF] 

  • Haipeng Chen, Bo An, Dusit Niyato, Yengchai Soh, Chunyan Miao. Workload factoring and resource sharing via joint vertical and horizontal cloud federation networks. IEEE Journal on Selected Areas in Communications, Vol.35, No.3, pp.557-570, 2017. [PDF]  

  • Bo An, Haipeng Chen, Noseong Park, V.S. Subrahmanian. Data-driven frequency-based airline profit maximization. ACM Transactions on Intelligent Systems and Technology, Vol.8, No.4, Article 61, 2017. [PDF]  

  • Jiarui Gan, Bo An. Game-theoretic considerations for optimizing taxi system efficiency. IEEE Intelligent Systems, Vol.32, No.3, pp.46-52, 2017. [PDF]  

  • Fei Fang, Thanh H. Nguyen, Rob Pickles, Wai Y. Lam, Gopalasamy R. Clements, Bo An, Amandeep Singh, Milind Tambe, Andrew Lemieux. PAWS – A deployed game-theoretic application to combat poaching. AI Magazine, Vol.38, No.1, pp.23-36. [PDF] 

  • Bo An, Milind Tambe. Stackelberg Security Games (SSG) Basics and Application Overview. Improving Homeland Security Decisions, Cambridge University Press, pp.485-507, 2017. [PDF]  

2016

  • Bo An, Haipeng Chen, Noseong Park, V.S. Subrahmanian. MAP: Frequency-based maximization of airline profits based on an ensemble forecasting approach. Proceedings of the 22nd ACM SIGKDD Conference on Knowledge Discovery and Data (KDD'16), pp.421-430. [PDF]

  • Jiang Rong, Tao Qin, Bo An, Tie-Yan Liu. Modeling bounded rationality for sponsored search auctions. Proceedings of the 22nd European Conference on Artificial Intelligence (ECAI'16), pp.515-523. [PDF]

  • Yue Yin, Bo An. Efficient resource allocation for protecting coral reef ecosystems. Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI'16), pp.531-537. [PDF]

  • Qingyu Guo, Bo An, Yair Zick, Chunyan Miao. Optimal interdiction of illegal network flow. Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI'16), pp.2507-2513. [PDF] [Appendix] 

  • Yue Yin, Yevgeniy Vorobeychik, Bo An, Noam Hazon. Optimally protecting elections. Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI'16), pp.538-545. [PDF]

  • Qingyu Guo, Bo An, Yevgeniy Vorobeychik, Long Tran-Thanh, Jiarui Gan, Chunyan Miao. Coalitional security games. Proceedings of the 15th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'16), pp.159-167. [PDF] [Appendix] 

  • Yanhai Xiong, Jiarui Gan, Bo An, Chunyan Miao, Soh Yeng Chai. Optimal pricing for efficient electric vehicle charging station management. Proceedings of the 15th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'16), pp.749-757. [PDF]

  • Jinhua Song, Yang Gao, Hao Wang, Bo An. Measuring the distance between finite Markov decision processes. Proceedings of the 15th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'16), pp.468-476. [PDF]

  • Jiang Rong, Tao Qin, Bo An, Tie-Yan Liu. Optimal sample size for adword auctions. Proceedings of the 15th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'16), pp.1459-1460. [PDF]

  • Zhen Wang, Yue Yin, Bo An. Computing optimal monitoring strategy for detecting terrorist plots. Proceedings of the 30th AAAI Conference on Artificial Intelligence (AAAI'16), pp.637-643, 2016. [PDF] [Appendix] 

  • Mengchen Zhao, Bo An, Christopher Kiekintveld. Optimizing personalized email filtering thresholds to mitigate sequential spear phishing attacks. Proceedings of the 30th AAAI Conference on Artificial Intelligence (AAAI'16), pp.658-665, 2016. [PDF] [Appendix] 

  • Shangdong Yang, Yang Gao, Bo An, Hao Wang, Xingguo Chen. Efficient average reward reinforcement learning using constant shifting values. Proceedings of the 30th AAAI Conference on Artificial Intelligence (AAAI'16), pp.2258-2264, 2016. [PDF]  

  • Fei Fang, Thanh H. Nguyen, Rob Pickles, Wai Y. Lam, Gopalasamy R. Clements, Bo An, Amandeep Singh, Milind Tambe, Andrew Lemieux. Deploying PAWS: Field optimization of the protection assistant for wildlife security. Proceedings of the 28th Annual Conference on Innovative Applications of Artificial Intelligence (IAAI'16), pp.3966-3973, 2016. Winner of Deployed Innovative Application Award. [PDF] 

  • Yuan Liu, Jie Zhang, Bo An, Sandip Sen. A simulation framework for measuring robustness of incentive mechanisms and its implementation in reputation systems. Journal of Autonomous Agents and Multi-Agent Systems, Vol.30, No.4, pp.581-600, 2016. [PDF] 

  • Bo An, Nicola Gatti, Victor Lesser. Alternating-offers bargaining in one-to-many and many-to-many settings. Annals of Mathematics and Artificial Intelligence, Vol.30, No.4, pp.581-600, 2016. [PDF] 

2015

  • Yue Yin, Haifeng Xu, Jiarui Gan, Bo An, Albert Jiang. Computing optimal mixed strategies for security games with dynamic payoffs. Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI'15), pp.681-687, 2015. [PDF] [Appendix] 

  • Yanhai Xiong, Jiarui Gan, Bo An, Chunyan Miao, Ana Bazzan. Optimal electric vehicle charging station placement. Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI'15), pp.2662-2668, 2015. [PDF]  

  • Jiarui Gan, Bo An, Chunyan Miao. Optimizing efficiency of taxi systems: Scaling-up and handling arbitrary constraints. Proceedings of the 14th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'15), pp.523-531, 2015. [PDF]  

  • Yujing Hu, Yang Gao, Bo An. Learning in multi-agent systems with sparse interactions by knowledge transfer and game abstraction. Proceedings of the 14th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'15), pp.753-761, 2015. [PDF]  

  • Jiang Rong, Tao Qin, Bo An. Computing quantal response equilibrium for sponsored search auctions. Proceedings of the 14th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'15), pp.1803-1804, 2015. [PDF]  

  • Jiarui Gan, Bo An, Yevgeniy Vorobeychik. Security games with protection externality. Proceedings of the 29th AAAI Conference on Artificial Intelligence (AAAI'15), pp.914-920, 2015. [PDF] [Appendix] 

  • Yujing Hu, Yang Gao, Bo An. Accelerating multiagent reinforcement learning by equilibrium transfer. IEEE Transactions on Cybernetics, Vol.45, No.7, pp.1289-1302, 2015. [PDF]  

  • Yujing Hu, Yang Gao, Bo An. Multi-agent reinforcement learning with unshared value functions. IEEE Transactions on Cybernetics, Vol.45, No.4, pp.647-462, 2015. [PDF]  

  • Mengchen Zhao, Bo An, Christopher Kiekintveld. An Initial Study on Personalized Filtering Thresholds in Defending Sequential Spear Phishing Attacks. Proceedings of the 2015 IJCAI Workshop on Behavioral, Economic and Computational Intelligence for Security. [PDF]  

  • Qingyu Guo, Bo An, Andrey Kolobov. Approximation approaches for solving security games with surveillance cost: A preliminary study. Proceedings of the Issues with Deployment of Emerging Agent-based Systems (IDEAS) Workshop, in conjunction with AAMAS'15. [PDF]  

  • Debarun Kar, Fei Fang, Francesco Delle Fave, Nicole Sintov, Arunesh Sinha, Aram Galstyan, Bo An, Milind Tambe. Learning bounded rationality models of the adversary in repeated Stackelberg security games. Proceedings of the Adaptive and Learning Agents (ALA) Workshop, in conjunction with AAMAS'15. [PDF]  

2014

  • Yue Yin, Bo An, Manish Jain. Game-theoretic resource allocation for protecting large public events. Proceedings of the 28th AAAI Conference on Artificial Intelligence (AAAI'14), pp.826-834, 2014. [PDF] 

  • Thanh Nguyen, Amulya Yadav, Bo An, Milind Tambe, Craig Boutilier. Regret-based optimization and preference elicitation for Stackelberg security games with uncertainty. Proceedings of the 28th AAAI Conference on Artificial Intelligence (AAAI'14), pp.756-762, 2014. [PDF]  

  • Yevgeniy Vorobeychik, Bo An, Milind Tambe, Satinder Singh. Computing solutions in infinite-horizon discounted adversarial patrolling games. Proceedings of the International Conference on Automated Planning and Scheduling (ICAPS'14), pp.314-322, 2014. [PDF]  

  • Galit Haim, Kobi Gal, Sarit Kraus, Bo An. Equilibrium strategies for human-computer negotiation in 3-player market settings. Proceedings of the 21st European Conference on Artificial Intelligence (ECAI'14), pp.417-422, 2014. [PDF]  

  • Han Yu, Chunyan Miao, Bo An, Shen Zhiqi, Cyril Leung. Reputation-aware task allocation for human trustees. Proceedings of the 13th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'14), pp.357-364, 2014. [PDF] 

  • Jiarui Gan, Bo An, Chunyan Miao. A scalable algorithm for solving taxi system efficiency optimization. Proceedings of the 13th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'14), pp.1465-1466, 2014. [PDF] 

  • Yue Yin, Bo An, Manish Jain. Dynamic allocation of security resources for protecting targets with varying values. Proceedings of the 13th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'14), pp.1473-1474, 2014. [PDF]  

  • Yuan Liu, Jie Zhang, Bo An, Sandip Sen. A practical robustness measure of incentive mechanisms. Proceedings of the 13th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'14), pp.1379-1380, 2014. [PDF]  

  • Qiong Wu, Chunyan Miao, Bo An. Modeling curiosity in virtual companions to improve human learners' learning experience. Proceedings of the 13th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'14), pp.1401-1402, 2014. [PDF]  

  • Ya'akov (Kobi) Gal, Avi Rosenfeld, Sarit Kraus, Michele Gelfand, Bo An, Jun Lin. A new paradigm for the study of corruption in different cultures. Proceedings of the 2014 International Conference on Social Computing, Behavioral-Cultural Modeling, & Prediction (SBP'14), pp.261-268, 2014. [PDF] 

  • Jiang Rong, Tao Qin, Bo An. Quantal response equilibrium for sponsored search auctions: Computation and inference. Proceedings of The 10th workshop on Ad Auctions, in conjunction with the 15th ACM Conference on Electronic Commerce (EC), 2014. [PDF] 

  • Matthew Brown, Bo An, Christopher Kiekintveld, Fernando Ordonez, Milind Tambe. An extended study on multi-objective security games. Journal of Autonomous Agents and Multi-Agent Systems, Vol.28, No.1, pp.31-71, 2014. [PDF] 

  • Han Yu, Zhiqi Shen, Chunyan Miao, Bo An, Cyril Leung. Filtering trust opinions through reinforcement learning. Decision Support Systems, Vol.66, pp.102-113, 2014. [PDF]  

  • Milind Tambe, Albert Jiang, Bo An, Manish Jain. Computational game theory for security: Progress and challenges. AAAI Spring Symposium on Spring Symposium on Applied Computational Game Theory, March 2014. [PDF] 

  • Jiarui Gan, Bo An. Minimum support size of the defender's strong Stackelberg equilibrium strategies in security games. AAAI Spring Symposium on Spring Symposium on Applied Computational Game Theory, March 2014. [PDF] 

  • Yue Yin, Bo An, Yevgeniy Vorobeychik, Jun Zhuang. Optimal deceptive strategies in security games: A preliminary study. AAAI Spring Symposium on Spring Symposium on Applied Computational Game Theory, March 2014. [PDF] 

2013

  • Jiarui Gan, Bo An, HaizhongWang, Xiaoming Sun, Zhongzhi Shi. Optimal pricing for improving efficiency of taxi systems. Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI'13), pp.2811-2818, 2013. [PDF] 

  • Han Yu, Miao Chunyan, Bo An. A reputation management model for resource constrained trustee agents. Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI'13), pp.418-424, 2013. [PDF] 

  • Bo An, Matthew Brown, Yevgeniy Vorobeychik, Milind Tambe. Security games with surveillance cost and optimal timing of attack execution. Proceedings of the 12th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'13), pp.223-230, 2013. [PDF]  

  • Han Yu, Shen Zhiqi, Chunyan Miao, Bo An. A reputation-aware decision-making approach for improving the efficiency of crowdsourcing systems. Proceedings of the 12th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'13), pp.1315-1316, 2013. [PDF]  

  • Bo An, Fernando Ordonez, Milind Tambe, Eric Shieh, Rong Yang, Craig Baldwin, Joseph DiRenzo, Ben Maule, Garrett Meyer. A deployed quantal response based patrol planning system for the US Coast Guard. Interfaces, Vol.43, No.5, pp.400-420, 2013. [PDF]  

  • Bo An, Nicola Gatti, Victor Lesser. Bilateral bargaining with one--sided uncertain reserve prices. Journal of Autonomous Agents and Multi-Agent Systems, Vol.26, pp.420-455, 2013. [PDF] 

2012

  • Bo An, David Kempe, Christopher Kiekintveld, Eric Shieh, Satinder Singh, Milind Tambe, Yevgeniy Vorobeychik. Security games with limited surveillance. Proceedings of the 26th AAAI Conference on Artificial Intelligence (AAAI'12), pp.1241-1248, July 2012. [PDF] 

  • Eric Shieh, Bo An, Rong Yang, Milind Tambe, Craig Baldwin, Joseph DiRenzo, Ben Maule, Garrett Meyer. PROTECT: An application of computational game theory for the security of the ports of the United States. Proceedings of the 26th AAAI Conference on Artificial Intelligence (AAAI'12), pp.2173-2179, July 2012. [PDF] 

  • Matthew Brown, Bo An, Christopher Kiekintveld, Fernando Ordonez, Milind Tambe. Multi-objective optimization for security games. Proceedings of the 11th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'12), pp.863-870, June 2012. [PDF] 

  • Eric Shieh, Bo An, Rong Yang, Milind Tambe, Craig Baldwin, Joseph DiRenzo, Ben Maule, Garrett Meyer. PROTECT: A deployed game theoretic system to protect the ports of the United States. Proceedings of the 11th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'12), pp.13-20, June 2012. [PDF] 

  • Yevgeniy Vorobeychik, Bo An, Milind Tambe. Adversarial Patrolling Games. Proceedings of the 11th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'12), pp.1307-1308, June 2012. [PDF] 

  • Milind Tambe, Bo An. Game theory for security: A real-world challenge problem for multiagent systems and beyond. Proceedings of the AAAI Spring Symposium on Game Theory for Security, Sustainability and Health, pp.69-74, 2012. [PDF] 

  • Manish Jain, Bo An, Milind Tambe. An overview of recent application trends at the AAMAS conference: security, sustainability and safety. AI Magazine, Vol.33, No.3, pp.14-28, 2012. [PDF]  

  • Bo An, Eric Shieh, Rong Yang, Milind Tambe, Craig Baldwin, Joseph DiRenzo, Ben Maule, Garrett Meyer. PROTECT - A deployed game theoretic system for strategic security allocation for the United States Coast Guard. AI Magazine, Vol.33, No.4, pp.96-110, 2012. [PDF]  

2011

  • Bo An, Milind Tambe, Fernando Ordonez, Eric Shieh, Christopher Kiekintveld. Refinement of strong Stackelberg equilibria in security games. Proceedings of the 25th AAAI Conference on Artificial Intelligence (AAAI'11), pp.587-593, August 2011. [PDF] 

  • Bo An, Victor Lesser, David Westbrook, Michael Zink. Agent-mediated multi-step optimization for resource allocation in distributed sensor networks. Proceedings of the 10th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'11), pp.609-616, May 2011. [PDF] 

  • Bo An, Victor Lesser. Negotiation over decommitment penalty. Proceedings of the 10th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'11), pp.1101-1102, May 2011. [PDF] 

  • Bo An, James Pita, Eric Shieh, Milind Tambe, Christopher Kiekintveld, Janusz Marecki. GUARDS and PROTECT: Next Generation Applications of Security Games. ACM SIGecom Exchanges, Vol.10, No.1, pp.31-34, March 2011. [PDF] 

  • Bo An, Manish Jain, Milind Tambe, Christopher Kiekintveld. Mixed-initiative optimization in security games: A preliminary report. AAAI Spring Symposium on Help Me Help You: Bridging the Gaps in Human-Agent Collaboration, pp.8-11, March 2011. [PDF] 

  • Bo An, Victor Lesser, Kwang Mong Sim. Strategic agents for multi-resource negotiation. Journal of Autonomous Agents and Multi-Agent Systems, Vol.23, pp.114-153, 2011. [PDF]  

2010

  • Bo An, Victor Lesser, David Irwin, Michael Zink. Automated negotiation with decommitment for dynamic resource allocation in cloud computing. Proceedings of the 9th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'10), pp.981-988, May 2010. [PDF] 

  • Bo An, Nicola Gatti, Victor LesserSearching for pure strategy equilibria in bilateral bargaining with one-sided uncertainty. Proceedings of the 9th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'10), pp.1607-1608, May 2010. Technical Report.  

  • Bo An, Victor Lesser. Characterizing contract-based multi-agent resource allocation in networks. IEEE Transactions on Systems, Man and Cybernetics, Part B: Cybernetics, Vol. 40, No. 3, pp. 575-586, June 2010. [PDF] 

2009

  • Bo An, Nicola Gatti, Victor Lesser. Bilateral bargaining with one-sided two-type uncertainty. Proceedings of the 2009 IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT'09), Sep. 2009. [PDF] 

  • Bo An, Nicola Gatti, Victor Lesser. Extending alternating-offers bargaining in one-to-many and many-to-many settings. Proceedings of the 2009 IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT'09), Sep. 2009. [PDF] 

  • Kwang Mong Sim, Bo An. Evolving best-response strategies for market-driven agents using aggregative fitness GA. IEEE Transactions on Systems, Man and Cybernetics, Part C, Vol. 39, No. 3, pp. 284-298, May 2009. [PDF] 

2008

  • Bo An, Fred Douglis, Fan Ye. Heuristics for negotiation schedules in multi-plan optimization. Proceedings of the Seventh International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'08), pp.551-558, 2008. [PDF]  

  • Bo An, Victor Lesser, Kwang Mong Sim. Decommitment in multi-resource negotiation. Proceedings of the Seventh International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'08), pp.1553-1556, 2008. [PDF] 

  • Bo An, Kwang Mong Sim, Chunyan Miao, Zhiqi Shen. Decision making of negotiation agents using Markov Chains. Multiagent and Grid Systems Journal, Vol. 4, No. 1, pp. 5-23, 2008. [PDF]  

2007

  • Bo An, Chunyan Miao, Zhiqi Shen. Market based resource allocation with incomplete information. Proceedings of the Twentieth International Joint Conference on Artificial Intelligence (IJCAI'07), pp.1193-1198, 2007. [PDF] 

  • Michael Krainin, Bo An, Victor Lesser. An application of automated negotiation to distributed task allocation. Proceedings of the 2007 IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT'07), pp.138-145, 2007. [PDF] 

  • Bo An, Zhiqi Shen, Chunyan Miao, Daijie Cheng. Algorithms for transitive dependence based coalition formation. IEEE Transactions on Industrial Informatics, Vol. 3, No. 3, pp. 234-245, Aug. 2007. [PDF] 

  • Bo An, Kwang Mong Sim, Victor Lesser. Evolving the best-response strategy to decide when to make a proposal. Proceedings of the 2007 IEEE Congress on Evolutionary Computation, pp.1035-1042, 2007. [PDF] 

2006

  • Bo An, Kwang Mong Sim, Lianggui Tang, Shuangqing Li, Daijie Cheng. Continuous time negotiation mechanism for software agents. IEEE Transactions on Systems, Man and Cybernetics, Part B: Cybernetics, Vol. 36, No. 6, pp. 1261-1272, Dec. 2006. [PDF] 

  • Bo An, Chunyan Miao, Daijie Cheng. A coalition formation framework based on transitive dependence. IEICE Transactions on Information and Systems, Vol.E88-D, No.12, pp.2672-2680, Dec. 2005. [PDF] 


Thesis Supervised

  • Wanqi Xue, 2024. Robust and Adaptive Decision-Making: A Reinforcement Learning Perspective. PhD Thesis. [PDF] 

  • Yanchen Deng, 2023. Empowering Distributed Constraint Optimization with Deep Learning. PhD Thesis. [PDF] 

  • Wei Qiu, 2023. Multi-agent Reinforcement Learning for Complex Sequential Decision-making. PhD Thesis. [PDF] 

  • Rundong Wang, 2023. Towards Efficient Cooperation within Learning Agents. PhD Thesis. [PDF] 

  • Jakub Cerny, 2023. Commitment and Coordination in Boundedly Rational Interactions. PhD Thesis. [PDF] 

  • Hongxin Wei, 2023. Natural Robustness Of Machine Learning in the Open World. PhD Thesis. [PDF] 

  • Aye Phyu Phyu Aung, 2022. Solving Large-scale Planning and Deep Learning Problems. PhD Thesis. [PDF] 

  • Xu He, 2021. Recommendation via Reinforcement Learning Methods. PhD Thesis. [PDF] 

  • Lei Feng, 2021. Advanced Topics in Weakly Supervised Learning. PhD Thesis. [PDF] 

  • Youzhi Zhang, 2020. Computing Team-Maxmin Equilibria in Zero-Sum Multiplayer Games. PhD Thesis. [PDF] 

  • Xinrun Wang, 2019. Defending on Networks: Applying Game Theory to Prevent Illegal Activities in Structured Security Domains. PhD Thesis. [PDF] 

  • Jiuchuan Jiang, 2019. Complex Task Allocation for Crowdsourcing in Social Network Context. PhD Thesis. [PDF] 

  • Rong Jiang, 2018. Equilibrium Computation and Revenue Optimization in Internet Applications. PhD Thesis. [PDF] 

  • Mengchen Zhao, 2018. Advanced Attack and Defense Technique in Machine Learning Systems. PhD Thesis. [PDF] 

  • Haipeng Chen, 2018. Large Scale Strategic Decision Making in Multi-Agent Systems. PhD Thesis. [PDF] 

  • Qingyu Guo, 2018. Combating Adversaries in Network-structured Security Domains. PhD Thesis. [PDF] 

  • Yanhai Xiong, 2018. Electric Vehicle Charging Station Placement and Management. PhD Thesis. [PDF] 

  • Yue Yin, 2016. Security Games with Complex Payoff Structures. PhD Thesis. [PDF] 

  • Jiarui Gan, 2015. A Game Theoretic Approach for Optimal Pricing of Taxi Markets. Master Thesis. [PDF] 


Some Articles in Chinese

  • Bo An, Jiarui Gan, 2021. 安全博弈. [PDF] 

  • Bo An, 2021. 分布式人工智能简介. [PDF]