SQL Coldfusion - 搜索重复名称时消除单词 THE

标签 sql coldfusion sql-server-2012 coldfusion-9 cfml

我有一个流程可以创建可能重复的公司列表。问题是“ABC Company, Inc.”。和“ABC Company, Inc.”都在德克萨斯州达拉斯可能是重复的,但我不会按照我的标准找到它们。我已经删除了前 4 个字符(如果它们是“the ”),但我还需要检查正确的 5 个字符(如果它们是“Inc.”)。

我有一个创建列名称的 View 。前缀“the”已被删除;

    SELECT     CASE WHEN LEFT(name, 4) = 'The ' THEN RIGHT(name, (len(name) - 4)) ELSE name END AS thename, CASE WHEN CHARINDEX(' ', ltrim(rtrim(Name))) 
                          = 0 THEN ltrim(Name) WHEN CHARINDEX(' ', ltrim(Name)) = 1 THEN ltrim('b') ELSE SUBSTRING(ltrim(Name) + ' x', 1, CHARINDEX(' ', ltrim(Name))) END AS subname, 
                          CHARINDEX(' ', LTRIM(Name)) AS wordcheck, Name, Address_Line_1, City AS Company_City, State AS Company_State, Zip, Area_Code, Phone, Status_Flag, ID, 
                          Not_Dupe_Flag, DUNS, Temp_Check_Dupes_Flag, Parent_Company_Number, Special_Display, 
                          CASE WHEN c.parent_company_number = 0 THEN c.id ELSE c.parent_company_number END AS parent
    FROM         dbo.Companies AS c

然后我在查询中使用该 View 来查找重复项;

        <cfquery name="qResults" datasource="#request.dsnlive#" timeout="200">
            SELECT b.ID,
            Thename,
            substring(TheName,1,(CHARINDEX(' ',TheNAME,1))) as subName,
            name,
            b.address_line_1,
            b.zip,
            b.company_state,
            b.company_city,
            b.area_code,
            b.phone,
            b.Special_Display,
            isnull(not_dupe_flag,'False') as not_dupe_flag,
            isnull(Temp_Check_Dupes_Flag,'False') as Temp_Check_Dupes_Flag,
            b.id as bID,
            b.duns
            FROM dbo.vw_Comp_Details_withFirstWord as b
            WHERE isnull(b.status_flag,'') != 'D'
            and b.ID <> #arguments.CompNum#
                and isnull(b.Temp_Check_Dupes_Flag,'False') = 'False'
            <cfif arguments.IncludeDunsOnly eq 0>
                <cfif arguments.FirstWord>
                    AND b.subName = '#arguments.CompanySubName#'
                <cfelse>
                    AND (substring(dbo.KeepAlphaNumCharacters(Thename),1,#val(arguments.WordLength)#) = substring('#arguments.CompanyName#',1,#val(arguments.WordLength)#)
                    or differnce(soundex(Thename),soundex('#arguments.CompanyName#')) > 2)
                </cfif>
                AND (
                ( company_city = '#arguments.City#'
                 AND Isnull(company_city, '') > '' )
                 AND ( b.parent != #val(arguments.Parent)#
                     AND Isnull(b.parent, '0') > 0 )
                     )
                <cfif arguments.IncludeDuns>
                AND (
                ( REPLACE(LTRIM(REPLACE(b.duns, '0', ' ')), ' ', '0') = '#val(arguments.Duns)#'
                 AND REPLACE(LTRIM(REPLACE(b.duns, '0', ' ')), ' ', '0') > ' '
                 AND #val(arguments.Duns)# > 0 )
                    or  REPLACE(LTRIM(REPLACE(b.duns, '0', ' ')), ' ', '0') = ' '
                    )
                </cfif>

            <cfelse>
                and (REPLACE(LTRIM(REPLACE(b.duns, '0', ' ')), ' ', '0') = '#val(arguments.Duns)#')
            </cfif>


        </cfquery>

现在我需要添加代码来删除后缀“Inc.”。但我似乎无法想出最终的逻辑,该列包含不带前缀“The”和后缀“Inc.”的名称

最佳答案

我想分享我的 question 几天前。这是在 Postgres 中完成的,但我确信您可以找到将字符串拆分为 RDBMS 行的等效方法。

您要做的就是拆分字符串并删除有问题的字符串,例如 TheInc

<强> SQL Fiddle Demo

| ID |   token |
|----|---------|
|  1 |     The |
|  1 |     ABC |
|  1 | Company |
|  1 |    Inc. |
|  2 |     ABC |
|  2 | Company |
|  2 |    Inc. |
|  3 |     ABC |
|  3 | Company |

然后你走另一条路并加入剩下的strings together postgres 使用 string_agg() MSsql 使用 XML PATH

关于SQL Coldfusion - 搜索重复名称时消除单词 THE,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/32780028/

相关文章:

php - 在存储到数据库之前添加到用户输入值

mysql - LEFT JOIN 表和 REGEXP 特定字符的 SUM 值

mysql - 加载数据文件错误: data too short for column

excel - 有没有办法使用 ColdFusion SpreadsheetSetHeader/Footer 函数在页眉/页脚中添加 Excel 页码

coldfusion - ColdFusion 中整数的树形图

javascript - Coldfusion Websockets "User is typing..."功能

sql-server-2012 - 缺少 Microsoft Excel 作为 SQL Server 数据导入工具中的选项

mysql - 在 codeigniter 中使用 Where 和 Where 条件更新 SQL

sql - 波斯语全文索引停止列表

sql-server - 在 CONCAT 中使用逐份打印