mysql - SQL - 连接表列中最常见的值

我有下面描述的三个表:

Area (Id, Description)

City(Id, Name)

Problem(Id, City, Area, Definition):
 City references City (Id), Area references Area (Id)

我想找到每个城市(名称)的问题中出现最频繁的区域(描述)值。

示例:

Area
Id   Description
1      Support
2      Finance  

City
Id      Name
1      Chicago
2      Boston

Problem
Id  City  Area  Definition
1     1     2       A
2     1     2       B
3     1     1       C
4     2     1       D

期望的输出:

 Name         Description
 Chicago        Finance
 Boston         Support

这是我尝试过但没有成功的方法:

SELECT Name,
       Description
FROM
  (SELECT *
   FROM Problem AS P,
        City AS C,
        Area AS A
   WHERE C.Id = P.City
     AND A.Id = P.Area ) AS T1
WHERE Description =
    (SELECT Description
     FROM
       (SELECT *
        FROM Problem AS P,
             City AS C,
             Area AS A
        WHERE C.Id = P.City
          AND A.Id = P.Area ) AS T2
     WHERE T1.Name = T2.Name
     GROUP BY Description
     ORDER BY Count(Name) DESC LIMIT 1 )
GROUP BY Name,
         Description

谢谢!

最佳答案

这可能是解决您的问题的最短方法:

select c.Name, a.Description
from City c
cross join Area a
where a.Id = (
    select p.Area
    from Problem p
    where p.City = c.Id
    group by p.Area
    order by count(*) desc, p.Area asc
    limit 1
)

我们使用 CROSS JOIN 将每个城市与每个区域组合起来。但我们只选择给定城市的 Problem 表中计数最高的 Area，这是在相关子查询中确定的。如果一个城市的两个区域具有相同的最高计数，则将选择按字母顺序排在第一位的区域(order by ... p.Area asc)。

结果:

|    Name | Description |
|---------|-------------|
|  Boston |     Support |
| Chicago |     Finance |

这是另一个更复杂的解决方案，其中包括计数。

select c.Name, a.Description, city_area_maxcount.mc as problem_count
from (
    select City, max(c) as mc
    from (
        select p.City, p.Area, count(*) as c
        from problem p
        group by p.City, p.Area
    ) city_area_count
    group by City
) city_area_maxcount
join (
    select p.City, p.Area, count(*) as c
    from problem p
    group by p.City, p.Area
) city_area_count
    on  city_area_count.City = city_area_maxcount.City
    and city_area_count.c = city_area_maxcount.mc
join City c on c.Id = city_area_count.City
join Area a on a.Id = city_area_count.Area

这里使用了两次别名为city_area_maxcount的子查询(我希望mysql可以缓存结果)。如果您将其视为一个表，那么这将是一个常见的查找每组顶部值的行问题。如果一个城市的两个区域的最高计数相同，则两个区域都会被选择。

结果:

|    Name | Description | problem_count |
|---------|-------------|---------------|
|  Boston |     Support |             1 |
| Chicago |     Finance |             2 |

演示:http://sqlfiddle.com/#!9/c66a5/2

关于mysql - SQL - 连接表列中最常见的值，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/40597379/

mysql - SQL - 连接表列中最常见的值

上一篇：php - 如何根据数据库中最后一个ID的下拉选择自动生成ID号

下一篇：PHP:将表行作为表单中的隐藏元素传递