Skip to content

Commit 568595a

Browse files
paper update
1 parent dcd3aa0 commit 568595a

File tree

1 file changed

+5
-1
lines changed

1 file changed

+5
-1
lines changed

research.markdown

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,11 @@ Find us on [Github](https://github.com/Algorithmic-Alignment-Lab).
1111

1212
### 2022
1313

14-
Christoffersen, P.J.K., Haupt, A.A, Hadfield-Menell, D. (2022). [Get It in Writing: Formal Contracts Mitigate Social Dilemmas in Multi-Agent RL](https://arxiv.org/abs/2208.10469).
14+
Räukur, T., Ho, A., Casper, S., & Hadfield-Menell, D. (2022). [Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks](https://arxiv.org/abs/2207.13243). arXiv preprint arXiv:2207.13243. [BibTeX](https://scholar.googleusercontent.com/scholar.bib?q=info:6IDnKqjNOrcJ:scholar.google.com/&output=citation&scisdr=CgUBYGTzEPyMg5PIZuc:AAGBfm0AAAAAYxjOfudLK6ychKhzX_GGjk7JydhRaQBs&scisig=AAGBfm0AAAAAYxjOfpUSteParaaZUb0Baq11kd8bT7oX&scisf=4&ct=citation&cd=-1&hl=en)
15+
16+
Casper, S., Hadfield-Menell, D., Kreiman, G (2022). [White-Box Adversarial Policies in Deep Reinforcement Learning](https://arxiv.org/abs/2209.02167).
17+
18+
Christoffersen, P.J.K., Haupt, A.A, Hadfield-Menell, D. (2022). [Get It in Writing: Formal Contracts Mitigate Social Dilemmas in Multi-Agent RL](https://arxiv.org/abs/2208.10469). [BibTeX](https://scholar.googleusercontent.com/scholar.bib?q=info:rctroivbpiAJ:scholar.google.com/&output=citation&scisdr=CgUBYGTzEPyMg5PIHNw:AAGBfm0AAAAAYxjOBNz2-vpDhjCI_wJ1FUgMgTwUEa8f&scisig=AAGBfm0AAAAAYxjOBPuUsEPypykSIeu3v7C_ZNMSKwx8&scisf=4&ct=citation&cd=-1&hl=en)
1519

1620
Yew, R.J. and Hadfield-Menell, D. (2022). [A Penalty Default Approach to Preemptive Harm Disclosure and Mitigation for AI Systems](https://dl.acm.org/doi/10.1145/3514094.3534130). In Proceedings of the 5th AAAI/ACM Conference on AI, Ethics, and Society. [BibTeX](https://scholar.googleusercontent.com/scholar.bib?q=info:Zy8cJGbw9QUJ:scholar.google.com/&output=citation&scisdr=CgWTYX5AEPyMg45o47g:AAGBfm0AAAAAYwVu-7hfL7sgjbex8wF3U-g2nDKsY20o&scisig=AAGBfm0AAAAAYwVu-y80HvtCEX2eXNg2NM7Ki7kE-BiC&scisf=4&ct=citation&cd=-1&hl=en)
1721

0 commit comments

Comments
 (0)