I'm way out of my depth on some, superficially quite simple, data analysis. I'd be happy to ship some beers (or mint tea/charitable giving/whatever) to anyone who can give me some structured help please?
I have some rows of data (about 125). Each row is a geographic population (like a county or city); collectively it's the population of England.
I have categorised each into one of three sets, depending on a pervailing policy in that area. Set A= places with policy A; B= policy B and C= no policy in place.
I have a rate/100,000 of population/year of an event for each row.
Imagine for example, house fires where some places provide free smoke alarms, others you pay and others are silent on the matter (it's not this, but it's an easier example to explain).
There's variaiton within each policy set. I can get mean (population weighted) and standard deviation. I can show that A, B and C are all statistically different from each other (at p<0.01 or better). So policy makes a clear difference.
I also have a poor correlation of the event with deprivation score (R^2 =~0.25). More deprived areas; more fires.
What I want to work out is *how much* of the variation in the total population is accounted for by policy, how much by deprivation , and how much by "other things". I have no idea how to do this.
I'm kinda daydreaming that someone will say "send me the Excel (125 rows; population, category, rate) and some beer and I can tell you"....or even just pointers about what tools to try. Thanks for your help!