Sunishchal Dev comments on Bounty: Diverse hard tasks for LLM agents

Sunishchal Dev Dec 24, 2023, 2:38 AM
LW: 11 AF: 5
0
AF
Thanks for the detailed instructions for the program! Just a few clarifications before I dive in:
1. The README file’s airtable links for task idea & specification submission seem to be the same. Did you mean to paste a different link for task ideas?
2. Are the example task definitions in the PDF all good candidates for implementation? Is there any risk of doing duplicate work if someone else chooses to do the same implementation as me?
3. If I want to do an implementation that isn’t in the examples list, is it a good idea to first submit it as an idea and wait for approval before working on the specification & implementation?
4. Are we allowed to use an LLM to automatically score a task? This seems useful for tasks with fuzzy outputs like answers to research questions. If so, would it need to be a locally hosted LLM like LLAMA? I imagine using an API-based model like GPT-4 would be susceptible to an external dependency that could change in reliability and possibly leak data.
- Beth Barnes Dec 26, 2023, 2:19 AM
  LW: 9 AF: 2
  0
  AF Parent
  Great questions, thank you!
  
  1. Yep, good catch. Should be fixed now.
  
  2. I wouldn’t be too worried about it, but very reasonable to email us with the idea you plan to start working on.
  
  3. I think fine to do specification without waiting for approval, and reasonable to do implementation as well if you feel confident it’s a good idea, but feel free to email us to confirm first.
  4. That’s a good point! I think using an API-based model is fine for now—because the scoring shouldn’t be too sensitive to the exact model used, so should be fine to sub it out for another model later. Remember that it’s fine to have human scoring also.
  - Sunishchal Dev Dec 26, 2023, 6:38 AM
    LW: 5 AF: 1
    0
    AF Parent
    Thanks, this is helpful!
    I noticed a link in the template.py file that I don’t have access to. I imagine this repo is internal only, so could you provide the list of permissions as a file in the starter pack?
    # search for Permissions in https://github.com/alignmentrc/mp4/blob/v0/shared/src/types.ts
    - Beth Barnes Dec 28, 2023, 10:23 PM
      LW: 6 AF: 1
      0
      AF Parent
      Ah, sorry about that. I’ll update the file in a bit. Options are “full_internet”, “http_get”. If you don’t pass any permissions the agent will get no internet access. The prompt automatically adjusts to tell the agent what kind of access it has.

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer