c# - 我可以使用流解压缩和反序列化文件吗?

标签 c# stream json.net gzipstream

我的应用程序使用 Json.Net 序列化一个对象,压缩生成的 JSON,然后将其保存到文件中。此外,应用程序可以从这些文件之一加载对象。这些对象的大小可能达到数十 Mb,由于现有代码创建大型字符串和字节数组的方式,我担心内存使用情况:-

public void Save(MyClass myObject, string filename)
{
    var json = JsonConvert.SerializeObject(myObject);
    var bytes = Compress(json);
    File.WriteAllBytes(filename, bytes);
}

public MyClass Load(string filename)
{    
    var bytes = File.ReadAllBytes(filename);
    var json = Decompress(bytes);
    var myObject = JsonConvert.DeserializeObject<MyClass>(json);
}

private static byte[] Compress(string s)
{
    var bytes = Encoding.Unicode.GetBytes(s);

    using (var ms = new MemoryStream())
    {
        using (var gs = new GZipStream(ms, CompressionMode.Compress))
        {
            gs.Write(bytes, 0, bytes.Length);
            gs.Close();
            return ms.ToArray();
        }
    }
}

private static string Decompress(byte[] bytes)
{
    using (var msi = new MemoryStream(bytes))
    {
        using (var mso = new MemoryStream())
        {
            using (var gs = new GZipStream(msi, CompressionMode.Decompress))
            {
                gs.CopyTo(mso);
                return Encoding.Unicode.GetString(mso.ToArray());
            }
        }
    } 
}

我想知道 Save/Load 方法是否可以用流代替?我找到了将流与 Json.Net 一起使用的示例,但我正在努力了解如何适应额外的压缩内容。

最佳答案

JsonSerializer有方法从 JsonTextReader 序列化和 StreamWriter ,两者都可以在任何类型的流之上创建,包括 GZipStream。使用它们,您可以创建以下扩展方法:

public static partial class JsonExtensions
{
    // Buffer sized as recommended by Bradley Grainger, https://faithlife.codes/blog/2012/06/always-wrap-gzipstream-with-bufferedstream/
    // But anything smaller than 85,000 bytes should be OK, since objects larger than that go on the large object heap.  See:
    // https://learn.microsoft.com/en-us/dotnet/standard/garbage-collection/large-object-heap
    const int BufferSize = 8192;
    // Disable writing of BOM as per https://datatracker.ietf.org/doc/html/rfc8259#section-8.1
    static readonly Encoding DefaultEncoding = new UTF8Encoding(false);

    public static void SerializeToFileCompressed(object value, string path, JsonSerializerSettings settings = null)
    {
        using (var fs = new FileStream(path, FileMode.Create, FileAccess.Write, FileShare.Read))
            SerializeCompressed(value, fs, settings);
    }

    public static void SerializeCompressed(object value, Stream stream, JsonSerializerSettings settings = null)
    {
        using (var compressor = new GZipStream(stream, CompressionMode.Compress))
        using (var writer = new StreamWriter(compressor, DefaultEncoding, BufferSize))
        {
            var serializer = JsonSerializer.CreateDefault(settings);
            serializer.Serialize(writer, value);
        }
    }

    public static T DeserializeFromFileCompressed<T>(string path, JsonSerializerSettings settings = null)
    {
        using (var fs = new FileStream(path, FileMode.Open, FileAccess.Read, FileShare.Read))
            return DeserializeCompressed<T>(fs, settings);
    }

    public static T DeserializeCompressed<T>(Stream stream, JsonSerializerSettings settings = null)
    {
        using (var compressor = new GZipStream(stream, CompressionMode.Decompress))
        using (var reader = new StreamReader(compressor))
        using (var jsonReader = new JsonTextReader(reader))
        {
            var serializer = JsonSerializer.CreateDefault(settings);
            return serializer.Deserialize<T>(jsonReader);
        }
    }
}

参见 Performance Tips: Optimize Memory Usage在 Json.NET 文档中。

关于c# - 我可以使用流解压缩和反序列化文件吗?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/32943899/

相关文章:

c# - Linq 返回 IEnumerable<MyObject>,如何返回 MyListObject?

c# - Wcf 服务返回流但不会关闭

c# - 反序列化 Microsoft.OData.Edm.Date

c# - 使用 JsonIgnoreAttribute 标记私有(private) setter

c# - 使用c#从json数据中获取值

c# - XPath - 返回子节点但不返回后代

c# - 如何从 url 获取 json 字符串?

c# - 是否有用于获取原始 Properties.Settings 的 API?

python - Azure Speech SDK 使用 python 从流中将语音转换为文本

http - Flutter http获得完成百分比?