qaq终于在凌晨刷完leetcode高频SQL50题。有道题做了两天，太难受了。

这儿说下我觉得比较难的一道题。标题链接：1164.指定日期的产品价格

题干信息

产品数据表:Products

+---------------+---------+
| Column Name   | Type    |
+---------------+---------+
| product_id    | int     |
| new_price     | int     |
| change_date   | date    |
+---------------+---------+
(product_id, change_date) 是此表的主键（具有唯一值的列组合）。
这张表的每一行分别记录了 某产品 在某个日期 更改后 的新价格。

编写一个解决方案，找出在2019-08-16这一天全部产品的价格，假设所有产品在修改前的价格都是10 。

以恣意次序 返回成果表。成果格局如下例所示。

示例 1:

输入：
Products 表:
+------------+-----------+-------------+
| product_id | new_price | change_date |
+------------+-----------+-------------+
| 1          | 20        | 2019-08-14  |
| 2          | 50        | 2019-08-14  |
| 1          | 30        | 2019-08-15  |
| 1          | 35        | 2019-08-16  |
| 2          | 65        | 2019-08-17  |
| 3          | 20        | 2019-08-18  |
+------------+-----------+-------------+
输出：
+------------+-------+
| product_id | price |
+------------+-------+
| 2          | 50    |
| 1          | 35    |
| 3          | 10    |
+------------+-------+

题干解析

在16号这一天的价格核算的逻辑不难，难点在于怎么求得两个跟8月16日最近的日期。

错误思路

一开始我想的是，通过日期核算，减去2019-08-16这天的日期，然后依据日期差的绝对值依照产品ID分组升序排序，终究每款产品前面两行的值便是离16日最近的日期。

select
    product_id,
    new_price,
    change_date, daydiff,
    row_number() over(partition by product_id order by abs(daydiff) asc) as rn
from (select
        product_id,
        new_price,
        change_date,
        datediff(change_date, "2019-08-16") as daydiff
    from Products) as t

这个主意在上面示例数据是可行的。可是假如每款产品存在多行与8月16日日期相近的数据的话，上述SQL就会取得错误的日期。而且这样没有办法判别rn=1仍是rn=2的日期较大（关于16日而言）。

由于下面的数据与实例的数据不同，所以粘贴一下下面的代码测试数据

那么这儿换一种思路，需求求解大于16日的日期中最小的日期，以及小于16日的日期中最大的日期，这两个日期，方便后续进行价格的核算。同样，也是与16日的日期进行相减核算日期差，可是这儿分隔来核算，即大于16日的日期跟小于16日的日期分隔两个SQL进行求解

-- 核算出大于2019-08-16的最小的日期
select
    product_id,
    new_price,
    change_date,
    datediff(change_date, "2019-08-16") as daydiff,
    row_number() over(partition by product_id order by datediff(change_date, "2019-08-16") asc) as rn
from Products
where datediff(change_date, "2019-08-16")>=0
;

由于这儿过滤掉了小于16日的日期，且数据量较少，所以看到的数据只要2行

-- 核算出小于2019-08-16的最大的日期
select
    product_id,
    new_price,
    change_date,
    datediff("2019-08-16", change_date) as daydiff,
    row_number() over(partition by product_id order by datediff("2019-08-16", change_date) asc) as rn
from Products
where datediff("2019-08-16", change_date)>=0
;

由于我两个SQL都用了大于等于，所以假如存在16日的数据，这两个SQL都会查询出来，这个对后面的判别愈加简单。还有一点便是，id为3的产品由于只要小于16日的数据，所以只要在第二个SQL才查询出来，也便是说存在部分产品只要16日前后修改正一次价格，在后面的SQL中需求用full outer join，可是很不巧，MySQL是没有这个full outer join的，可是可以用left outer join union right outer join来完成。（注意这儿有必要要用union进行去重）

关于16日的价格，这儿给出几种判别逻辑：

16日当天修改正价格
只要一行数据，日期小于16号，则取最新日期
只要一行数据，日期大于16号，则取变价前的值
两个日期都大于16号，则取变价前的值
日期一大一小，则取小的值
最大的日期都小于16日，则取最新的变价
最小的也大于16日，则取变价前的值

解题

先求解16日前后最近的日期

-- 核算出大于2019-08-16的最小的日期
with tmp as (select
    product_id,
    new_price,
    change_date,
    datediff(change_date, "2019-08-16") as daydiff,
    row_number() over(partition by product_id order by datediff(change_date, "2019-08-16") asc) as rn
from Products
where datediff(change_date, "2019-08-16")>=0),
tmp2 as (
-- 核算出小于2019-08-16的最大的日期
select
    product_id,
    new_price,
    change_date,
    datediff("2019-08-16", change_date) as daydiff,
    row_number() over(partition by product_id order by datediff("2019-08-16", change_date) asc) as rn
from Products
where datediff("2019-08-16", change_date)>=0)
;

然后将上述两个暂时表做full outer join

-- 核算出大于2019-08-16的最小的日期
with tmp as (select
    product_id,
    new_price,
    change_date,
    datediff(change_date, "2019-08-16") as daydiff,
    row_number() over(partition by product_id order by datediff(change_date, "2019-08-16") asc) as rn
from Products
where datediff(change_date, "2019-08-16")>=0),
tmp2 as (
-- 核算出小于2019-08-16的最大的日期
select
    product_id,
    new_price,
    change_date,
    datediff("2019-08-16", change_date) as daydiff,
    row_number() over(partition by product_id order by datediff("2019-08-16", change_date) asc) as rn
from Products
where datediff("2019-08-16", change_date)>=0)
select
        ta.product_id,
        ta.price1,
        tb.price2,
        ta.date1,
        tb.date2
    from 
        (select product_id, new_price as price1, change_date as date1 from tmp where rn=1) as ta
    left outer join
        (select product_id, new_price as price2, change_date as date2 from tmp2 where rn=1) as tb
    on ta.product_id=tb.product_id
union
select
        tb.product_id,
        ta.price1,
        tb.price2,
        ta.date1,
        tb.date2
    from 
        (select product_id, new_price as price1, change_date as date1 from tmp where rn=1) as ta
    right outer join
        (select product_id, new_price as price2, change_date as date2 from tmp2 where rn=1) as tb
    on ta.product_id=tb.product_id
;

成果如下图所示：

然后对上述表进行求解即可：

-- 核算出大于2019-08-16的最小的日期
with tmp as (select
    product_id,
    new_price,
    change_date,
    datediff(change_date, "2019-08-16") as daydiff,
    row_number() over(partition by product_id order by datediff(change_date, "2019-08-16") asc) as rn
from Products
where datediff(change_date, "2019-08-16")>=0),
tmp2 as (
-- 核算出小于2019-08-16的最大的日期
select
    product_id,
    new_price,
    change_date,
    datediff("2019-08-16", change_date) as daydiff,
    row_number() over(partition by product_id order by datediff("2019-08-16", change_date) asc) as rn
from Products
where datediff("2019-08-16", change_date)>=0)
select
    product_id,
    case
        when date1='2019-08-16' then price1  -- 等于16号
        when date2='2019-08-16' then price2
        when date2 is null and date1<'2019-08-16' then price1  -- 只要一行数据，日期小于16号，则取最新日期
        when date2 is null and date1>'2019-08-16' then 10  -- 只要一行数据，日期大于16号，则取变价前的值
        when date1>date2 and date2>'2019-08-16' then 10  -- 两个日期都大于16号，则取变价前的值
        when date1>date2 and date1>'2019-08-16' and date2<'2019-08-16' then price2  -- 一大一小，则取小的值
        when date1<'2019-08-16' and date2<'2019-08-16' then price1  -- 最大的日期都小于16日，则取最新的变价
        when date2>'2019-08-16' then 10  -- 最小的也大于16日，则取变价前的值
        when date1 is null and date2<'2019-08-16' then price2
        else null 
    end as price
from
    (select
            ta.product_id,
            ta.price1,
            tb.price2,
            ta.date1,
            tb.date2
        from 
            (select product_id, new_price as price1, change_date as date1 from tmp where rn=1) as ta
        left outer join
            (select product_id, new_price as price2, change_date as date2 from tmp2 where rn=1) as tb
        on ta.product_id=tb.product_id
    union
    select
            tb.product_id,
            ta.price1,
            tb.price2,
            ta.date1,
            tb.date2
        from 
            (select product_id, new_price as price1, change_date as date1 from tmp where rn=1) as ta
        right outer join
            (select product_id, new_price as price2, change_date as date2 from tmp2 where rn=1) as tb
        on ta.product_id=tb.product_id) as tc
;

这儿采用的是case进行处理。那么假如是用if进行判别呢？（这儿不建议用）首要if语句没有办法将所有状况写到一列中，每种判别逻辑需求单独一列，后续在用ifnull将成果合并为一列，进程繁琐，而且可读性差，修改逻辑的话代码可维护性差！！！这儿粘贴下错误的示例：

这段代码在数据量少的状况下是可以执行成功的，如上述的测试数据

这儿查看下中间状况的数据，即终究成果前的状况

由于这个数据量太少了，关于判别逻辑来说，所以会存在很多的空列。一起也说明了case关于if的优越性！！！所以能用case的状况下尽量用case，会使代码全体的可读性以及观赏性上一个层次的。哈哈

每日SQL一练#20231030

题干信息

题干解析

错误思路

解题

作者信息

推广

每日SQL一练#20231030

题干信息

题干解析

错误思路

解题

相关文章

实时增量学习在云音乐直播推荐系统中的工程实践

独立开发最重要的还是产品要打出差异化

ChatGPT 背后的“功臣”——RLHF 技术详解

火山引擎边缘云，助力业务敏捷创新

作者信息

推广