Let's conduct a survey on the quality of MIRI's implementation
By Robert_Wiblin @ 2016-02-19T07:18 (+11)
Many people here, myself included, are very concerned about the risks from rapidly improving artificial general intelligence (AGI). A significant fraction of people in that camp give to the Machine Intelligence Research Institute, or recommend others do so.
Unfortunately, for those who lack the necessary technical expertise, this is partly an act of faith. I am in some position to evaluate the arguments about whether safe AGI is an important cause. I'm also in some position to evaluate the general competence and trustworthiness of the people working at MIRI. On those counts I am satisfied, though I know not everyone is.
However, I am in a poor position to evaluate:
- The quality of MIRI's past research output.
- Whether their priorities are sensible or clearly dominated by alternatives.
- Have an existing reputation for trustworthiness and confidentiality.
- Think that AI risk is an important cause, but have no particular convictions about the best approach or organisation for dealing with it. They shouldn't have worked for MIRI in the past, but will presumably have some association with the general rationality or AI community.
- Involve 10-20 people, including a sample of present and past MIRI staff, people at organisations working on related problems (CFAR, FHI, FLI, AI Impacts, CSER, OpenPhil, etc), and largely unconnected math/AI/CS researchers.
- Results should be compiled by two or three people - ideally with different perspectives - who will summarise the results in such a way that nothing in the final report could identify what any individual wrote (unless they are happy to be named). Their goal should be purely to represent the findings faithfully, given the constraints of brevity and confidentiality.
- The survey should ask about:
- Quality of past output.
- Suitability of staff for their roles.
- Quality of current strategy/priorities.
- Quality of operations and other non-research aspects of implementation, etc.
- How useful more funding/staff would be.
- Comparison with the value of work done by other related organisations.
- Suggestions for how the work or strategy could be improved.
- Obviously participants should only comment on what they know about. The survey should link to MIRI's strategy and recent publications.
- MIRI should be able to suggest people to be contacted, but so should the general public through an announcement. They should also have a chance to comment on the survey itself before it goes out. Ideally it would be checked by someone who understand good survey design, as subtle aspects of wording can be important.
- It should be impressed on participants the value of being open and thoughtful in their answers for maximising the chances of solving the problem of AI risk in the long run.
undefined @ 2016-02-20T03:32 (+21)
Thanks for the write-up, Rob. OpenPhil actually decided to evaluate our technical agenda last summer, and Holden put Daniel Dewey on the job. The report isn't done yet, in part because it has proven very time-intensive to fully communicate the reasoning behind our research priorities, even to someone with as much understanding of the AI landscape as Daniel Dewey. Separately, we have plans to get an independent evaluation of our organizational efficacy started later in 2016, which I expect to be useful for our admin team as well as prospective donors.
FYI, when it comes to evaluating our research progress, I doubt that the methods you propose would get you much Bayesian evidence. Our published output will look like round pegs shoved into square holes regardless of whether we're doing our jobs well or poorly, because we're doing research that doesn't fit neatly into an existing academic niche. Our objective is to make direct progress on what appear to us to be the main neglected technical obstacles to developing reliable AI systems in the long term, with a goal of shifting the direction of AI research in a big way once we hit certain key research targets; and we're specifically targeting research that isn't compatible with industry's economic incentives or academia's publish-or-perish incentives. To get information about how well we're doing our jobs, I think the key questions to investigate are (1) whether we've chosen good research targets; and (2) whether we're making good progress towards them.
We've been focusing our communication efforts mainly on helping people evaluate (1): I've been working on explaining our approach and agenda, and OpenPhil is also on the job. To investigate (2), we'd need to spend a sizable chunk of time with mathematically adept evaluators — we still haven't hit any of our key research targets, which means that evaluating our progress requires understanding our smaller results and why we think they're progress towards the big results. In practice, we've found that explaining this usually requires explaining why we think the big targets are vital, as this informs (e.g.) which shortcuts are and are not acceptable. I plan to wait until after the OpenPhil report is finished before taking on another time-intensive eval.
Fortunately, (2) will become much easier to evaluate as we achieve (or persistently fail to achieve) those key targets. This also provides us with an opportunity to test our approach and methodology. People who understand our approach and find it uncompelling often predict that some of the results we're shooting for cannot be achieved. This means we'll get some evidence about (1) as we learn more about (2). For example, last year I mentioned "naturalized AIXI" as an ambitious 5-year research target. If we are not able to make concrete progress towards that goal, then over the next four years, I will lose confidence in our approach and eventually change our course dramatically. Conversely, if we make discoveries that are important pieces of that puzzle, I'll update in favor of us being onto something, especially if we find puzzle pieces that knowledgeable critics predicted we wouldn’t find. This data will hopefully start rolling in soon, now that our research team is getting up to size.
("Concrete progress" / "important puzzle pieces" in this case are satisfactory asymptotic algorithms for any of: (1) reasoning under logical uncertainty; (2) identifying the best available decision with respect to a utility function; (3) performing induction from inside an environment; (4) identifying the referents of goals in realistic world-models; and (5) reasoning about the behavior of smarter reasoners; the last of which is hopefully a subset of 1 and 2. The linked papers give rough descriptions of what counts as 'satisfactory' in each case; I'll work to make the desiderata more explicit as time goes on.)
undefined @ 2016-02-19T17:49 (+8)
I think that it's probably quite important to define in advance what sorts of results would convince us that the quality of MIRI's performance is either sufficient or insufficient. Otherwise I expect those already committed to some belief about MIRI's performance to consider the survey evidence for their existing belief, even if another person with the opposite belief also considers it evidence for their belief.
Relatedly, I also worry about the uniqueness of the problem and how it might change what we consider a cause worth donating to. Although you don't seem to be thinking that you could understand MIRI's arguments and see no flaws and still be inclined to say "I still can't be sure that this is the right way to go," I expect that many people are averse to donating to causes like MIRI because the effectiveness of the proposed interventions does not admit to simple testing. With existential risks, empirical testing is often impossible in the traditional sense, although sometimes possible in a limited sense. Results about sub-existential pandemic risk are probably at least somewhat relevant to the study of existential pandemic risk, for example. But it's not the same as distributing bed nets, looking at the malaria incidence, adjusting, reobserving, and so on and so on. It's not like we can perform an action, look through a time warp, and see whether or not the world ends in the future. And what I'm getting at is that, even if this is not really the nature of these problems, even if it is not the case that interventions upon these problems are not testable, we might imagine the implications if it were the case that they were genuinely untestable. I think that there are some people who would refuse to donate to existential risk charities merely because other charities have interventions testable for effectiveness. And this concerns me. If it is not by human failing that we don't test the effectiveness of our interventions, but it is the nature of the problem that you cannot test the effectiveness of your interventions, do you choose to do nothing? That is not a rhetorical question. I genuinely believe that we are confused about this and that MIRI is an example of a cause that may be difficult to evaluate without resolving this confusion. This is related to ambiguity aversion in cognitive science and decision theory. Even though ambiguity aversion appears in choices between betting on known and unknown risks, and not in choices to bet or not to bet on unknown risks in non-comparative contexts, effective altruists consider almost all charitable decisions within the context of cause prioritization, which means that we might expect EAs to encounter more comparative contexts than a random philanthropist, and thus for them to exhibit more bias towards causes with ambiguity, even if the survey itself would technically be focusing on one cause. It's noteworthy that the expected utility formalism and human behavior differ in the sense that the expected utility formalism prescribes indifference between bets with known and unknown probabilities in the case that each bet has the same payoffs. (In reality the situation is not even this clear, for the payoffs of successfully intervening upon malaria incidence as opposed to human extinction are hardly equal.) I think we must genuinely ask if we should be averse to ambiguity in general, and to attempt to explain why this heuristic was evolutionarily adaptive, and to see if the problem of existential risk is an example of a case either where we should, or where we should not, use ambiguity aversion as a heuristic. After all, a humanity that attempts no interventions on the problem of existential risk merely because it cannot test the effectiveness of its interventions is a humanity that ignores existential risk and goes extinct for it, even if we believed that we were being virtuous philanthropists the entire time.
undefined @ 2016-02-19T11:55 (+5)
I admire the motivation, but worry about selection effects.
I'd guess the median computer science professor hasn't heard about MIRI's work. Within the class of people who know about MIRI-esque issues, I'd guess knowledge of MIRI and enthusiasm about MIRI will be correlated: if you think FAI is akin to overpopulation on mars, you probably won't be paying close attention to the field. Thus those in a position to comment intelligently on MIRI's work will be selected (in part) for being favourably disposed to the idea behind it.
That isn't necessarily a showstopper, and it may be worth doing regardless. Perhaps multiple different attempts to gather ('survey' might be too strong a term) relevant opinion on the various points could be a good strategy. E.g.
Similar to the FHI/MIRI timelines research, interrogating computer scientists as to their perception of AI risk, and the importance of alignment etc. would be helpful data.
Folks at MIRI and peer organisations could provide impressions of their organizational efficacy. This sort of 'organisational peer review' could be helpful for MIRI to improve. Reciprocal arrangements between groups within EA reviewing each others performance and suggesting improvements could be a valuable activity going forward.
For technical facility, one obvious port of call would be academics who remarked on the probabilistic set theory paper, as well as MIRI workshop participants (especially those who did not end up working at MIRI). As a general metric (given MIRI's focus on research) a comparison of number of publications/$ or FTE research staff to other academic bodies would be interesting. My hunch is this would be unflattering to MIRI (especially when narrowing down to more technical/math heavy work) - but naively looking at publication count may do MIRI a disservice, given it is looking at weird and emergent branches of science.
Another possibility, instead of surveying people who already know about MIRI (and thus selection worries) is to pay someone independent to get to know about them. I know Givewell made a fairly adverse review of MIRIs performance a few years ago. I'd be interested to hear what they think about them now. I'm unaware of 'academic auditors', but it might not be unduly costly to commission domain experts to have a look at the relevant issues. Someone sceptical of MIRI might suggest that usually this function is performed by academia generally, and MIRI's relatively weak connection to academia at large in these technical fields is a black mark against it (albeit one I know they are working to correct).
undefined @ 2016-02-19T19:22 (+3)
A survey like this is probably a good idea, although it might not give us any evidence that isn't already publicly available. A non-AI risk expert already has quite a few indicators about MIRI's quality:
- It has gotten several dozen papers accepted to conferences.
- Some of these papers have a decent number of citations, many have ~5. (You can find number of citations on Google Scholar but I don't know a good way to get this information other than just manually searching for papers and looking at the citations.) Many of the citations are by other MIRI papers; most are by MIRI/FHI/CSER/associated people, probably because these are the only groups doing real work on AI risk.
- MIRI regularly collaborates with other organizations or individuals working on AI risk, which suggests that these people value MIRI's contributions.
- Stuart Russell, one of the world's leading AI researchers, sits on the advisory board of MIRI, and appears to have plans to collaborate with MIRI.
If we did a survey like this one, it would probably be largely redundant with the evidence we already have. The people surveyed would need to be AI risk researchers, which pretty much means a small handful of people at MIRI, FHI, FLI, etc. Lots of these people already collaborate with MIRI and cite MIRI papers. Still, we might be able to learn something from hearing their explicit opinions about MIRI, although I don't know what.
undefined @ 2016-02-19T17:16 (+2)
MIRI currently spends around $2 million dollars a year - including some highly skilled labour that is probably underpriced
Their 2014 financials on https://intelligence.org/transparency/ say their total expenditures in 2014 were $948k. Their 2015 financials aren't up yet, and I think they did expand in 2015, but I don't think you can claim this unremarked. This is not a neutral error; if you make them look twice as big as they are, then you also make them look half as efficient.
undefined @ 2016-02-19T18:03 (+2)
Simply took a look at their latest fundraising page:
"although we may still slow down or accelerate our growth based on our fundraising performance, our current plans assume a budget of roughly $1,825,000 per year."
https://intelligence.org/2015/12/01/miri-2015-winter-fundraiser/
So hopefully it is indeed not a neutral error.
undefined @ 2016-02-19T18:17 (+5)
Ok, I admit I didn't think to check there. Arguing the semantics about what "currently spends" means would be pointless, and I recognize that this remark was in the context of estimating how MIRI's future budget would be affected, but I do think that in the context of a discussion about evaluating past performance, it's important not to anchor people's expectations on a budget they don't have yet.
undefined @ 2016-02-19T12:19 (+2)
Why wouldn't we just expect them to publish in peer reviewed journals?
undefined @ 2016-02-19T16:38 (+6)
AI researchers don't usually publish in peer reviewed journals, they present at conferences. MIRI has presented lots of papers at conferences.
See here: https://intelligence.org/all-publications/
Over the past few years, MIRI has published a couple dozen conference papers and a handful of journal articles.
undefined @ 2016-02-19T18:06 (+2)
I have nothing against that specifically, but publishing in peer reviewed journals is very costly and slow. Most MIRI funders would think journals are currently biased against the relevant research and that is one thing MIRI is trying to change. Knowing they are publishing papers also wouldn't speak to the strategy.