python - 多个 MySQL JOIN 和重复的单元格

标签 python mysql sql pandas

我有两个 MySQL 查询,它们给出了我正在寻找的结果。我最终想将这两个查询组合起来生成一个表,但我陷入了困境。

查询 1:

SELECT scoc.isr, outcome_concept_id, concept_name as outcome_name
FROM standard_case_outcome AS sco INNER JOIN concept AS c
ON sco.outcome_concept_id = c.concept_id
INNER JOIN standard_case_outcome_category AS scoc
ON scoc.isr = sco.isr
WHERE scoc.outc_code = 'CA'

结果 1:

RESULT OF QUERY 1 (TRUNCATED)

查询 2:

SELECT scoc.isr, drug_seq, concept_id, concept_name as drug_name
FROM standard_case_drug AS scd INNER JOIN concept AS c
ON scd.standard_concept_id = c.concept_id
INNER JOIN standard_case_outcome_category AS scoc
ON scoc.isr = scd.isr
WHERE scoc.outc_code = 'CA'

结果 2:

RESULT OF QUERY 2 (TRUNCATED)

期望的结果: DESIRED RESULT

我很确定我可以弄清楚如何使用Python/pandas来做到这一点,但我想知道是否有(a)一种方法可以做到这一点MySQL(b)使用MySQL来做这件事有什么好处。

** 如果您好奇,this is the entire dataset .

这是相关表的数据库结构:

# Dump of table concept
# ------------------------------------------------------------

CREATE TABLE `concept` (
  `concept_id` int(11) NOT NULL,
  `concept_name` varchar(255) NOT NULL,
  `domain_id` varchar(20) NOT NULL,
  `vocabulary_id` varchar(20) NOT NULL,
  `concept_class_id` varchar(20) NOT NULL,
  `standard_concept` varchar(1) DEFAULT NULL,
  `concept_code` varchar(50) NOT NULL,
  `valid_start_date` date NOT NULL,
  `valid_end_date` date NOT NULL,
  `invalid_reason` varchar(1) DEFAULT NULL,
  PRIMARY KEY (`concept_id`),
  UNIQUE KEY `idx_concept_concept_id` (`concept_id`),
  KEY `idx_concept_code` (`concept_code`),
  KEY `idx_concept_vocabluary_id` (`vocabulary_id`),
  KEY `idx_concept_domain_id` (`domain_id`),
  KEY `idx_concept_class_id` (`concept_class_id`)
) ENGINE=InnoDB DEFAULT CHARSET=latin1;



# Dump of table standard_case_drug
# ------------------------------------------------------------

CREATE TABLE `standard_case_drug` (
  `primaryid` varchar(512) DEFAULT NULL,
  `isr` varchar(512) DEFAULT NULL,
  `drug_seq` varchar(512) DEFAULT NULL,
  `role_cod` varchar(512) DEFAULT NULL,
  `standard_concept_id` int(11) DEFAULT NULL,
  KEY `idx_standard_case_drug_primary_id` (`primaryid`(255),`drug_seq`(255)),
  KEY `idx_standard_case_drug_isr` (`isr`(255),`drug_seq`(255)),
  KEY `idx_standard_case_drug_standard_concept_id` (`standard_concept_id`)
) ENGINE=InnoDB DEFAULT CHARSET=latin1;



# Dump of table standard_case_outcome
# ------------------------------------------------------------

CREATE TABLE `standard_case_outcome` (
  `primaryid` varchar(512) DEFAULT NULL,
  `isr` varchar(512) DEFAULT NULL,
  `pt` varchar(512) DEFAULT NULL,
  `outcome_concept_id` int(11) DEFAULT NULL,
  `snomed_outcome_concept_id` int(11) DEFAULT NULL,
  KEY `idx_standard_case_outcome_primary_id` (`primaryid`(255)),
  KEY `idx_standard_case_outcome_isr` (`isr`(255)),
  KEY `idx_standard_case_outcome_outcome_concept_id` (`outcome_concept_id`),
  KEY `idx_standard_case_outcome_snomed_outcome_concept_id` (`snomed_outcome_concept_id`)
) ENGINE=InnoDB DEFAULT CHARSET=latin1;



# Dump of table standard_case_outcome_category
# ------------------------------------------------------------

CREATE TABLE `standard_case_outcome_category` (
  `primaryid` varchar(512) DEFAULT NULL,
  `isr` varchar(512) DEFAULT NULL,
  `outc_code` varchar(512) DEFAULT NULL COMMENT 'Code for a patient outcome (See table below) CODE MEANING_TEXT ----------------DE Death LT Life-ThreateningHO Hospitalization - Initial or ProlongedDS DisabilityCA Congenital AnomalyRI Required Intervention to Prevent PermanentImpairment/DamageOT Other Serious (Important Medical Event) NOTE: The outcome from the latest version of a case is provided. If there is more than one outcome, the codes willbe line listed.',
  `snomed_concept_id` int(11) DEFAULT NULL,
  KEY `idx_standard_case_outcome_category_primary_id` (`primaryid`(255)),
  KEY `idx_standard_case_outcome_category_isr` (`isr`(255)),
  KEY `idx_standard_case_outcome_category_snomed_concept_id` (`snomed_concept_id`,`outc_code`(255))
) ENGINE=InnoDB DEFAULT CHARSET=latin1;

最佳答案

这应该会得到你想要的结果。

SELECT
    `scoc`.`isr`                AS `isr`,
    `sco` .`outcome_concept_id` AS `outcome_concept_id`,
    `c1`  .`concept_name`       AS `outcome_name`,
    `scd` .`drug_seq`           AS `drug_seq`,
    `scd` .`concept_id`         AS `concept_id`,
    `c2`  .`concept_name`       AS `drug_name`
FROM
    `standard_case_outcome` AS `sco`
INNER JOIN
    `concept` AS `c1`
    ON
        `sco`.`outcome_concept_id` = `c1`.`concept_id`
LEFT JOIN
    `standard_case_drug` AS `scd`
    ON
        `sco`.`isr` = `scd`.`isr`
INNER JOIN
    `concept` AS `c2`
    ON
        `scd`.`outcome_concept_id` = `c2`.`concept_id`
INNER JOIN
    `standard_case_outcome_category` AS `scoc`
    ON
        `scoc`.`isr` = `sco`.`isr`
WHERE
    `scoc`.`outc_code` = 'CA'

编辑

请注意,我省略了 concept 表,因为您没有从中选择任何内容,也没有用它过滤结果。

第二次编辑

已更新以包含概念表。更新的问题表明,SELECT 中实际上需要它。

第三次编辑

需要分别为scoscd选择concept.name

关于python - 多个 MySQL JOIN 和重复的单元格,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/40033167/

相关文章:

python - 嵌套 for 循环 n 次以将列表的子列表添加到自身

python - darkflow - 属性错误 : 'NoneType' object has no attribute 'find'

sql - 如何在一个查询中显示与所有反馈相关的所有标签

php - 如何在 where 条件下比较多个 id

python - 如何为经过身份验证的用户添加登录后信息?

Python 3 石头剪刀布循环

php - SQL MAX 与 WHERE 子句一起使用

php - 重置现有表中的 id 值(不删除记录)

mysql - 有序随机性

SQL查询查找员工周年纪念日