Write a query to print the sum of all total investment values in 2016 (TIV_2016), to a scale of 2 decimal places, for all policy holders who meet the following criteria:
Input Format:
The insurance table is described as follows:
| Column Name | Type | |-------------|---------------| | PID | INTEGER(11) | | TIV_2015 | NUMERIC(15,2) | | TIV_2016 | NUMERIC(15,2) | | LAT | NUMERIC(5,2) | | LON | NUMERIC(5,2) |
where PID is the policyholder's policy ID, TIV_2015 is the total investment value in 2015, TIV_2016 is the total investment value in 2016, LAT is the latitude of the policy holder's city, and LON is the longitude of the policy holder's city.
Sample Input
| PID | TIV_2015 | TIV_2016 | LAT | LON | |-----|----------|----------|-----|-----| | 1 | 10 | 5 | 10 | 10 | | 2 | 20 | 20 | 20 | 20 | | 3 | 10 | 30 | 20 | 20 | | 4 | 10 | 40 | 40 | 40 |
Sample Output
| TIV_2016 | |----------| | 45.00 |
Explanation
The first record in the table, like the last record, meets both of the two criteria. The TIV_2015 value '10' is as the same as the third and forth record, and its location unique. The second record does not meet any of the two criteria. Its TIV_2015 is not like any other policyholders. And its location is the same with the third record, which makes the third record fail, too. So, the result is the sum of TIV_2016 of the first and last record, which is 45.
GROUP BY
and COUNT
[Accepted]Intuition
\nTo decide whether a value in a column is unique or not, we can use GROUP BY
and COUNT
.
Algorithm
\nCheck whether the value of a record\'s TIV_2015 is unique, if it is not unique, and at the same time, its location (LAT, LON) pair is unique, then this record meeting the criteria. So it should be counted in the sum.
\nMySQL
\nSELECT\n SUM(insurance.TIV_2016) AS TIV_2016\nFROM\n insurance\nWHERE\n insurance.TIV_2015 IN\n (\n SELECT\n TIV_2015\n FROM\n insurance\n GROUP BY TIV_2015\n HAVING COUNT(*) > 1\n )\n AND CONCAT(LAT, LON) IN\n (\n SELECT\n CONCAT(LAT, LON)\n FROM\n insurance\n GROUP BY LAT , LON\n HAVING COUNT(*) = 1\n )\n;\n
\n\nTips: Concat the LAT and LON as a whole to represent the location information.
\n
Note: These two criteria should be met without an order, so if you attempt to filter data using criteria #1 first and then criteria #2, you will get a wrong result.
\nTaking the sample input as an example, the data set will be as following after taking the first criteria.
\nPID | \nTIV_2015 | \nTIV_2016 | \nLAT | \nLON | \n
---|---|---|---|---|
1 | \n10 | \n5 | \n10 | \n10 | \n
3 | \n10 | \n30 | \n20 | \n20 | \n
4 | \n10 | \n40 | \n40 | \n40 | \n
Then, the second criteria cannot filter any records on this data set. So the result is 75(5+30+40), which is obviously wrong since the location of record with PID \'3\' is actually the same with the record having been filtered by the first criteria.
\nPID | \nTIV_2015 | \nTIV_2016 | \nLAT | \nLON | \n
---|---|---|---|---|
2 | \n20 | \n20 | \n20 | \n20 | \n
3 | \n10 | \n30 | \n20 | \n20 | \n