rain8dome9

Karma: 18

rain8dome9 May 10, 2025, 7:40 AM
1 point
0
on: D&D.Sci Tax Day: Adventurers and Assessments Evaluation & Ruleset
Started to work on this puzzle but gave up. Still want to show what I made. This picture is supposed to shows how tax increases as one item type increases. Each color line is an initial set of other items enumerated under ‘key’ (“Cockatrice” “Dragon” “Lich” “Zombie”). There is a picture like this for every taxable item type.

rain8dome9 May 10, 2025, 6:27 AM
2 points
0
on: There’s more low-hanging fruit in interdisciplinary work thanks to LLMs
Please give an example of the output that the LLM produces that is so useful.

rain8dome9 May 7, 2025, 2:05 AM
1 point
0
on: The Best Textbooks on Every Subject
For years, my self-education was stupid and wasteful. … Teaching Company courses,
I wonder if this is still true. I really liked Language Families of the World lectures by a professor from Colombia University. It is the best audiobook by the Great Courses I have found so far. Listening felt more like intelligent entertainment than study but that could be remedied with secondary materials.

rain8dome9 Mar 16, 2025, 2:14 AM
4 points
0
on: A Bear Case: My Predictions Regarding AI Progress
Could you describe the experiment you ran on all theses models? Like ‘if there are three boxes side by side in a line and each can hold one item and the red triangle is not in the middle and the blue circle is not in the box next to the box with a red triangle in it where is the green circle? ’. Chatgpt was not able to solve logic puzzles a year ago and can do it now.

rain8dome9 Jan 10, 2025, 1:12 AM
3 points
2
in reply to: Dagon’s comment on: What’s the best metric for measuring quality of life?
That said, the dimensions of quality that the FDA concerns itself with (including physical functioning, self-reported pain, and other easily- and not-easily-measured things) is likely close enough to “improves quality of life” that it’s not necessary to have a new direction.
Athletic performance. Cognitive performance. Work performance. Also ability to accomplish the things needed in every day life to have uh fun..

rain8dome9 Nov 28, 2024, 8:23 PM
1 point
0
in reply to: Michael Latowicki’s comment on: Missing forecasting tools: from catalogs to a new kind of prediction market
Then wolfram alpha?

rain8dome9 Nov 14, 2024, 12:16 AM
1 point
0
on: Gears vs Behavior
I thinks its worth mentioning that there are two levels of black box models too. ML can memorize the expected value at each set of variables (at 1 rmp crank wheel rotates at 2 rpm) or it can ‘generalize’ and, for this example, tell us that the wheel rotates at 2x speed of crank. To some extent ‘ML generalization’ provides good ‘out of distribution’ predictions.

rain8dome9 Nov 13, 2024, 9:54 PM
4 points
0
on: Missing forecasting tools: from catalogs to a new kind of prediction market
There is no “Wikipedia for predictive models” that I know of. No big repository to easily share and find predictive scientific models other than the relevant domain’s scientific literature, which is not optimized for these tasks: it is not organized by the variables being predicted, it is not generally available as reusable and modular software components, it is usually not focused on predictive work, some of it is paywalled, etc.
Have you tried www.openml.org?

rain8dome9 Nov 12, 2024, 9:10 PM
3 points
0
on: The Median Researcher Problem
Prototypical example: imagine a scientific field in which the large majority of practitioners have a very poor understanding of statistics, p-hacking, etc. Then lots of work in that field will be highly memetic despite trash statistics, blatant p-hacking, etc. Sure, the most competent people in the field may recognize the problems, but the median researchers don’t, and in aggregate it’s mostly the median researchers who spread the memes.
Complicated analysis (like going far beyond p-values) is easy for anyone to see and it is evidence of effort. Complex analysis usually coocurs with thoroughness so fewer mistakes. Complicated analysis coocurs with many concurrent tests so less need to produce positive results so less p-hacking. Consequently, there is a fairly simple solution to researchers with mediocre statistical skills gaining too much trust: more plots! Anyway, I find correlation graphs and multiple comparison impressive. Also I am usually more skilled in data analysis than the subject of a paper so can more easily verify that.

rain8dome9 Nov 12, 2024, 7:49 PM
1 point
0
on: Resolving von Neumann-Morgenstern Inconsistent Preferences
Is this a paper? Has it been published anywhere?

rain8dome9 Apr 10, 2024, 8:31 PM
1 point
0
on: Toward a Broader Conception of Adverse Selection
Relevant quote from Dragonfired by J. Zachary Pike. “Brokers make money by knowing key information; they make fortunes by ensuring that other brokers remain unaware or unsure of the same information until after critical trades.”

rain8dome9 Mar 21, 2024, 8:33 PM
1 point
0
on: Using axis lines for good or evil
In ggplot (R statistical language) the defaults include a subtle grid and no axes. They also put in the extra random space.
Here is some code in case someone else using R wants to try out things discussed here:
library(ggplot2)
qplot(wt, mpg, data = mtcars, colour = factor(cyl)) +
theme(axis.line.x = element_line(color=”black”, size = 0),
axis.line.y = element_line(color=”black”, size = 1)) +
scale_x_continuous(expand = c(0, 0), limits = c(0,8)) +
scale_y_continuous(expand = c(0, 0), limits = c(0,36))

rain8dome9 Dec 17, 2023, 3:03 AM
7 points
0
in reply to: niplav’s comment on: Please Bet On My Quantified Self Decision Markets
Might be able to use Multi-Armed Bandit-like sampling for this, even? Hm…
Effects may take time and may require time to build up to detectable levels. This is why Winters increased the length of each intervention till they lasted some weeks. If the placebo causes a different self report rating then its a bad placebo and should be Blinded out but if it causes a psychological improvement then why not use it?
so non-X days will be more likely measured as being high in X-effect. But that’d mean that X days are more likely followed by non-X, which with random order is not the case.
Yes but it will still make the effect size much less.
Could you elaborate on this a bit
Lag and build up is mentioned above. Training effect is when you get better at something just by doing it, so later interventions look better. At the same time there may be drift of self report. In other words effect of slowly growing change on memory making user think there is no change. For all these reasons plot the time series with time on X results on Y and make each point the color of intervention or placebo. Do not connect the dots with lines but do make a smooth loess-like line. You will be able to see some of the issues if they occur. Some more on all the issues.

rain8dome9 Dec 14, 2023, 12:06 AM
3 points
0
on: Please Bet On My Quantified Self Decision Markets
The more important an effect is usually the stronger it is so starting many of the experiments but for a short time might yield results much faster. May be possible to overlap the non blinded experiments and run many at the same time with varying periodicity so the same interventions do not always happen on top of each other.
Your statistical method is similar to two sample t test right? Well that does not account for several possible issues of time series and dependence between data points of one variable. Lag and training effects for example. So be sure to control all other possible independent variables and plot the data timeline and when you do do not connect data points with lines!

rain8dome9 Dec 10, 2023, 5:17 AM
3 points
0
on: Please Bet On My Quantified Self Decision Markets
In all experiments, I will be using the statistical method detailed here, code for it here, unless someone points out that I’m doing my statistics wrong.
Links lead no nowhere?

rain8dome9 Nov 2, 2023, 2:29 AM
1 point
0
in reply to: niplav’s comment on: rain8dome9′s Shortform
Will you try running the two notebooks on your data? I am starving for feedback and attention.

rain8dome9 Oct 30, 2023, 11:52 PM
1 point
0
on: rain8dome9′s Shortform
Really thorough statistical analysis of Anki (flashcard app) data
rpubs.com/rain8/1100036 Its a work in progress with only two steps finished. Not exactly an addon because its in R not Py. So far the project does many little things like find bugs in user’s collection, describe the growth of their collection and text mining. Ultimate goal is to hopefully be able to use anki as continuous cognitive tester and allow users to learn about and optimize their memorization process. Instructions to run on your own data : github
I am not sure data in anki could really be used as a continuous cognitive health test. Probably requires removing lots of artifacts and other influences and then finding outside influence that definitely relates to cognition. Lit review.

rain8dome9 Aug 26, 2023, 7:42 PM
1 point
0
on: Feedbackloop-first Rationality
I am willing to be a test subject. Evidence that I am serious is I have 119k reviews on Anki and am analyzing the data hoping it will be a psychometric test.

rain8dome9 Jun 26, 2023, 9:49 PM
2 points
0
in reply to: mcint’s comment on: My resentful story of becoming a medical miracle
https://wiki.openhumans.org/wiki/Finding_relations_between_variables_in_time_series This is the link I meant to post.

rain8dome9 Apr 17, 2023, 9:34 PM
2 points
0
on: ACX/LW Meetup
Thank you that was enlightening.

rain8dome9

Really thorough statistical analysis of Anki (flashcard app) data