python - 可变数据类型的 Django 模型

我想做的是一个跟踪个人记录的数据库。该模型即将完成，但我面临着存储不同类型记录的一些困难。

有时间、重量、重复/圈数、距离的记录...因此，有不同类型的数据:时间、小数、整数...

首先，我为每种类型的数据创建了一个表(或 django 中的类)。一个表为时间，其他为体重(十进制)等等。

但我想知道是否有更好的解决方案，只为所有记录保留一张表。

我的分离表/模型的代码(因此，每种类型我都有一个类，总共 5 个模型):(这很好用，但我必须预先选择适当的模型来插入数据。)

class PRWeight(model.Models):  # there are PRDistance, PRLaps, PRHeight, PRTime
    user = FK(User)
    exercise = FK(Exercise)
    date = datefield()
    weight = decimalfield()  # integer, integer, decimal, time

    class Meta:
        unique_together = [user, exercise, date,]

或者我可以做这样的事情，如果它是好的或者有更好的解决方案:

class PR(models.Model):
    user = FK(User)
    exercise = FK(Exercise)
    date = datefield()
    metric = FK(Metric)  # choose between time, weight, height...
                         # the aspect beeing mesured
    record =             # how can I call the right field type?

或者我可以用blank=True替换其他五个字段的“记录”字段

class PR(models.Model):
    user = FK(User)
    exercise = FK(Exercise)
    date = datefield()
    metric = FK(Metric)  # choose between time, weight, height, distance...
                         # the aspect beeing mesured
                         # not necessary in this approach
    wheight = decimalfield(blank=True)
    height = decimalfield(blank=True)
    time = timefield(blank=True)
    distance = integerfield(blank=True)
    laps = integerfield(blank=True)

我正在寻找一个简单的解决方案。到目前为止，我倾向于选择最后一个示例，因为它很简单，但是填写表单的用户可能会犯错误......

最佳答案

表包含使某些语句(由列名参数化)为真的行。表应该只包含列，只要您可以使用它们发出单个语句即可。

将许多但相关的列放在一起

如果(用户、锻炼、日期)始终具有体重和日期，则使用一个表格。

-- user [user] in exercise [exercise] on [date] weighed [weight] kg and was [height] m tall
PRstuff(user,exercise,date,weight,height)

但是，如果(用户、锻炼、日期)可能有体重但没有日期，则需要单独的表格。

-- user [user] in exercise [exercise] on [date] lifted [weight] kg
PRlift(user,exercise,date,weight)
-- User [user] in exercise [exercise] on [date] jumped [height] m
PRjump(user,exercise,date,height)

条件列很复杂

可以有一个像第三个示例一样的语句/表:

--     user [user] did [exercise] on [date]
-- AND ( lifted != blank and they lifted [weight] kg OR lifted = blank and they didn't lift )
-- AND ( jumped != blank and they jumped [height] m OR jumped = blank and they didn't jump )
PR(user,exercise,date,weight,height)

但是正如您所看到的，您的查询语句(表语句的组合)和 SQL 表达式(表的组合)变得复杂。基本上，在查询中，无论如何，您都必须不断将该表分割成单独的非条件表版本。

不要使用记录类型的列

有时我们可能有一个列，其类型由固定部分组成。但是，如果您想使用 SQL 中的逻辑条件来查询各个部分，那么您应该将各个部分放入表的列中。

-- user [user] in exercise [exercise] on [date] weighed [record.weight] kg and was [record.height] m tall
PRrecord(user,exercise,date,record)

SELECT * FROM PRrecord
WHERE PRrecord.record.weight = 100 -- really, record_dot(PRrecord.record,'weight')=100

这里第一个点是数据库表操作，第二个点是编程语言记录操作。 DBMS 无法优化您的查询，因为它优化表操作，而不是数据类型操作。基本上，它必须获取一大堆不查看记录值的行，然后调用记录运算符点，然后调用字段相等，然后丢弃大量行。

SELECT * FROM PR
WHERE PR.weight = 100

现在，DBMS 可以将字段相等性结合到优化获取行的方式中，因为您只使用了表版本的 dot。

不要使用容器类型的列

有时我们可能有一个列，其类型由相似部分的集合组成。但是，如果您想使用 SQL 中的逻辑条件来查询部件，那么您应该创建一个新表。新表有一个针对该特定集合的 PK 列，而旧表有一列是新 PK 的 FK。

-- user [user] in exercise [exercise] on [date] and their set of lifted weights in kg is [weights]
PRlifts(user,exercise,date,weights)

SELECT user FROM PRlifts l
WHERE l.name = 'Fred' AND set_has_member(l.weights,200)
AND ??? no two lifts were the same weight ???

不好。请注意该语句是多么复杂。此外，DBMS 无法优化查询，因为 set_has_member 不是表操作。更糟糕的是，您甚至无法查询某些条件，您必须编写非查询循环代码。

SELECT user FROM PRlift l
WHERE l.user = 'Fred' AND l.weight = 200
AND NOT EXISTS(
    SELECT weight
    FROM Prlift l1, PRlift l2
    WHERE l1.user = l.user AND l2.user = l.user AND l1.weight = l2.weight
)

现在 DBMS 可以优化并消除循环。

(请注意，如果这是

WHERE string_length(l1.name) = string_length(l2.name)

然后 DBMS 可以通过名称长度列进一步优化。但通常 DBMS 具有字符串和某些其他类型的特殊知识，并且或多或少会进行优化，就好像某些列存在与某些运算符的值相对应一样。事实上，DBMS 可以了解记录和集合类型，但您仍然无法拥有简单的语句和无循环查询。)

关于python - 可变数据类型的 Django 模型，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/24107755/

python - 可变数据类型的 Django 模型

上一篇：python - Cython 安装警告

下一篇：python - 单个表上的 SQLAlchemy 多对多关系错误 : NoReferencedTableError